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Washington, D.C. 20231 

RESPONSE ANT) AMENDMENT UNDER 37 C.F.R. g 1.1 1 1 

Remarks 

Claims 58, 62, and 73-75 are pending in the subject application. The Office Action of 
November 19, 2001 has set forth a variety of rejections 



Rejection under 35 U.S.C. $ 101: 

The Office Action, dated November 19, 2001, has rejected claims 58, 62, and 73-75 under 
35 U.S.C. § 101 for allegedly lacking utility. Specifically, the Office Action alleges that the only 
taught utilities of the protein (encompassed by the present claims) are to detect the protein, to make 
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antibodies, and to screen drugs, and that while these utilities are credible, they are allegedly neither 
specific nor substantial (pages 2-5 of the November 19, 2001 Office Action). Applicants respectfully 
traverse this rejection. 

Applicants maintain that the utilities the Patent Office aclcnowledges to be credible in the 
Office Action are also specific and substantial In particular, in view of: a) the strong association 
between the genomic region encoding the present protein and schizophrenia; b) the fact that the 
present protein is expressed mainly in the brain; c) the fact that the protein is a novel member of a 
family of proteins frequently implicated in neuropsychiatric disorders; and d) G713 being a candidate 
gene for neuropsychiatric disorders, on the basis of both structure and function (see infra)\ 
AppHcants submit that the present protein has specific and substantial utility in the identification of 
individuals at risk for the development of neurodegenerative disorders or the treatment and diagnosis 
of neuropsychiatric disorders such as schizophrenia. 

Substantial utility 

The Office Action alleges that the cited utilities of the protein encompassed by the present 
claims that are provided in the specification are not substantial. Applicants respectfully traverse. 

]n particular, the Office Action states that evidence of the association of all glutamine repeat 
proteins with any disease state might be sufficient to provide a substantial utility. Applicants 
respectfully submit that this is not the standard for the establishment of the utility of an mvention. 
The evidentiary standard for application in this matter is "preponderance of the evidence". Thus, the 
issue is more properly framed as, "Is it more likely than unlikely that proteins containing 
polyglutamine repeats are associated wdth neurodegenerative diseases?" To this end, the Office 
Action indicates that: 

S:\S H-RESP\GEN\T 1 24 XVResponse- 1 8Feb02.doc/DNB/jaj 
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•'...many [genes] have glutamine repeats. A search of STN revealed that 
14,560 sequences in the database which had a 3 repeat sequence of CAGCAGCAG, 
of which 6 723 were associated with human sequences" (page 4, November 19, 2001 
Office Action). 

Applicants respectfully submit that this analysis is irrelevant to the question at hand. As recognized 
by the Patent Office, previously presented in arguments by Applicants' counsel, and presented 
herein, proteins containing glutamine repeats, and expansions of such repeats, are associated with at 
least twelve (12) diseases, including neurodegenerative diseases. Thus, Applicants respectfully 
submit that those skilled in the art are well aware of the relationship between glutamine repeats and 
neurological disorders. Furthermore, Applicants respectfully submit that, on the basis of its 
expression in brain and its CAG (glutamine) repeat (among other hallmarks of repeats known to 
undergo expansion in disease states), a person of ordinary skill in the art would "consider [G713] a 
candidate gene for neuropsychiatric disorder on the basis of both structure and function" [paragraph 
bridging pages 119-20; Margolis et al [1997] Human Genetics 100:114-122; Exhibit A]. 
Additionally, given the occurrence of sequential repeats of four glutamines, nine glutamines, and six 
glutamines within the G713 protein, Applicants respectfully submit that a person of ordinary skill in 
the art would perceive the use, by the Patent Office, of "a 3 repeat sequence of CAGCAGCAG 
[(CAG)3]" rather than a 9-19 repeat sequence of (CAG)9.i9 to be inappropriate for assessing the 
significance of the glutamine repeat within G713. 

It is further respectfully submitted that an absolute number of 6,723 hits for proteins 
containing a three repeat glutamine sequence is meaningless. In the context of this invention, it is 
the number and relationship of glutamine repeats with neurodegenerative disorders that are of 
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significance. Applicants respectfully submit that a more meaningful number is obtained from the 
paper by Margolis et ai (Exhibit A), wherein it was found that about 20 distinct cDNA clones were 
isolated from screening 200,000 or more plaques of human brain cDNA libraries with (CAG)i5. This 
corresponds to a representation of glutamine repeat within brain cDNA on the order of about 0.01%, 
a value inconsistent with the Patent Office's assertion that ''many [genes] have glutamine repeats" 
[emphasis added; page 4, November 19, 2001 Office Action]. Indeed, the skilled artisan would 
reasonably expect that genes and proteins containing high numbers of glutamine repeats are 
associated with neurodegenerative disorders on the basis of structure and function in neurological 
tissue (iVIargolis etai. Exhibit A, paragraph bridging pages 1 19-120). Thus, Applicants respectfully 
submit that a person of ordinary skill in the art would find reasonable Applicants' conclusion that 
G713 is suitable for screening as a marker for neuropsychiatric disorders on the basis of both 
structure and function. 

Applicants also respectfully point out that the M.P.E.P. at § 2 1 07.02 states, with reference to 

In re Langer: 

As a matter of Patent Office practice, a specification which contains a disclosure of 
utility which corresponds in scope to the subject matter sought to be patented must be 
taken as sufficient to satisfy the utility requirement of § 101 for the entire claimed 
subject matter unless there is a reason for one skilled in the art to question the 
objective truth of the statement of utility or its scope. 

Applicants respectfully submit that: 1) in view of the strong association between the genomic region 

encoding the present protein and schizophrenia; 2) in view of the fact that the present protein is 

expressed mainly in the brain; 3) in view of the fact that the protein is a novel member of a family of 

proteins frequently implicated in neuropsychiatric disorders; and 4) in view of G713 being a 

candidate gene for neuropsychiatric disorder on the basis of both structure and function, that there is 
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a correlation or association of the present protein with predisposition to the onset of neuropsychiatric 

disorders such as schizophrenia. As stated in the M.P.E.P. at § 2107.01: 

An assay that measures the presence of a material which has a stated correlation to a 
predisposition to the onset of a particidar disease condition [emphasis added] would 
also define a ''real world" context of use in identifying potential candidates for 
preventive measures or further monitoring. 

As one of the cited utilities of the protein encompassed by the present claims that are provided in the 

specification is to detect the protein (see supra). Applicants respectfully submit that the utilities, of 

the protein encompassed by the present claims, provided in the specification are substantial. 

Specific utility 

The Office Action also alleges that cited utilities, of the protein encompassed by the present 
claims, that are provided in the specification are "not specific because Perutz ([1996] Current 
Opinion Structural Biology- 6:848-58) ... has identified many different proteins with glutamine 
repeats, all of which are associated with different diseases and Kashima has identified at least one 
protein not disease associated with glutamine repeats" (page 4, November 19, 2001 Office Action). 

Applicants respectfully dispute the Patent Office's representation of the Perutz paper. In the 
first sentence, Perutz states: ''Several [emphasis added] dominantly inherited, late onset, 
neurodegenerative diseases [emphasis added] are due to expansion of CAG repeats, leading to 
expansion of glutamine repeats in the affected proteins." Perutz then proceeds to discuss oiAy seven 
neurodegenerative diseases. Furthermore, as indicated in Margolis et al, twelve (12) diseases, most 
with neurotrophic features, arise from trinucleotide repeat expansion mutations (see Abstract and 
Introduction). 
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Applicants further respectfully traverse the relevance of the Kashima paper. The observation 
by Kashima et ai that ''the consecutive glutamine repeats do not play a role in the biological and 
immunological activities of MIL-2" (page 4, November 19, 2001 Office Action) is irrelevant to 
examination of the present appHcation. Applicants further respectfully submit that as MIL-2 is not 
expressed in brain, MIL-2 is a priori irrelevant to any discussion of the role of CAG repeats, and 
their expansion, in neuropsychiatric disorders such as schizophrenia. Furthermore, Kashima et aL, 
and the teachings therein, are directed to the structure and function of murine intedeukin 2 (MIL-2) 
and contain no teachings with respect to neurodegenerative disorders. Thus, it is unclear what, if 
any, nexus or evidentiary value the reference has to the assessment of the role of glutamine repeats in 
neurodegenerative disorders. Furthermore, as the Office Action acknowledges, glutamine repeats, 
and their expansion, are art recognized to be associated with a variety of neurodegenerative 
disorders, such as schizophrenia. Accordingly, reconsideration and withdrawal of the rejection is 
respectfully requested. 
Rejection under 35 U.S.C. $ 1 12: 

In the Office Action dated November 1 9, 200 1 , the Patent Office rejected claims 58, 62, and 
73-75 under 35 U.S.C. § 1 12, first paragraph, as allegedly containing subject matter which was not 
described in the specification in such a way as to enable one skilled in the art to which it pertains, or 
with which it is most nearly connected, to use the invention. Specifically, the Office Action alleges 
that die cited utilities, of the protein encompassed by the present claims, that are provided in the 
specification allegedly are not specific. Applicants respectfiilly traverse this rejection. 
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Nature of Invention 

Claims are drawn to a G7 1 3 protein and methods of detection of this protein or gene product. 
The Office Action alleges that "the nature of this invention is a . . . protein, with no other associated 
information" [page 5-6, November 19, 2001 Office Action]. 

Applicants respectfully traverse this representation of the invention in view of: 1) the strong 
association between the genomic region encoding the present protein and schizophrenia; 2) the fact 
that the present protein is expressed mainly in the brain; 3) the fact that the protein is a novel 
member of a family of proteins frequently implicated in neuropsychiatric disorders; and 4) the fact 
that G713 is a candidate gene for neuropsychiatric disorder on the basis of both structure and 
function (see supra and infra). 
Breadth of the Claims 

As the Patent Office acknowledges, ''claims ... are drawn to a G713 protein and methods of 
detection of this protein or gene product'' [page 5, November 19, 2001 Office Action]. 
.Ajnount of Guidance in the Specification 

The Office Action alleges that the utilities of the protein encompassed by the present claims 
that are provided in the specification are "not found to be substantial nor specific and, consequently, 
the specification provides NO guidance regarding how to use this protein" [page 6, November 19, 
2001 Office Action]. Applicants respectfiilly disagree. As indicated 5wpra, the subject invention 
provides credible, substantial, and specific uses of the presently claimed invention. Applicants 
submit that the argument of a consequential lack of "guidance regarding how to use this protein", 
arising fi-om the allegation that the invention lacks specific and substantial utility, is improper and 
respectfully request reconsideration and withdrawal of this point of rejection. 
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Working Examples 

The Office Action also alleges that "the absence of working examples in the specification" 
[pages, November 19, 2001 Office Action] is a valid basis for a conclusion that ''undue 
experimentation would be required to use this invention as claimed" [page 8, November 19, 2001 
Office Action]. Applicants respectfully traverse this point of rejection and submit that a person of 
ordinary skill in the art would know how to use the present invention in an "assay for detection or 
diagnosis" [page 6, November 19, 2001 Office Action]. In support of Applicants' position, it is 
noted that just one type of assay encompassed by the method of claim 73, namely enzyme linked 
immunosorbent assays, is associated with at least 3307 references in PubMed [Exhibit B]. 

.\mount of Guidance in Prior Art 

The Office Action also argues that "the prior art provides no guidance with regard to the 
particular function of the G71 3 protein and does not even provide support or guidance for glutamine 
repeat containing proteins having a particular use or association" [page 6, November 19, 2001 Office 
Action]. It is respectfully submitted that this allegation is without basis in view of the art-recognized 
association of glutamine repeats with neurodegenerative diseases (see, for example, Margolis etai). 

The Patent Office also reiterates an argument based on the Kashima paper. As indicated 
supra, the relevance and evidentiary value of the Kashima paper to the instant invention is unclear. 
The reference concerns the role of glutamine repeats in the biological function of a murine cytokine 
and is devoid of any teachings related to glutamine repeats and their association with 
neurodegenerative disorders. Applicants further reiterate that in the context of the present invention, 
whether glutamine repeats play a role in the normal function of a given protein is irrelevant; as 
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explicitly acknowledged elsewhere by the Patent Office, high numbers of glutamine repeats, and 
their expansion, are hallmarks of neurodegenerative disorders and the proteins associated therewith. 
Skill in the Art 

The Patent Office believes that "the skill in the art would be considered high" [page 7. 
November 19, 2001 Office Action]. Applicants concur. 
Predictability of the Art 

The Patent Office cites a review article by Wright et al. ([2001] Schizophrenia Research 
47:1-12) to support his assertion of the "unpredictability in this linkage of schizophrenia with 
chromosomal locations" (page 1, November 19, 2001 Office Action). Applicants respectfully 
dispute the Patent Office's representation of linkage studies of schizophrenia as unpredictable. 
Genetic analysis of schizoplirenia and other diseases believed to have a number of susceptibility loci 
are complicated but not invalidated by issues of population stratification and statistical power. These 
considerations are well appreciated by persons of ordinary skill in the art and are discussed, for 
example, in Weinberger et al. ([2001] Biological Psychiatry 50:825-44; Exhibit C). 

The issue of population stratification is well known to Applicants as evidenced by the 
analysis presented in Example 2(g) of the specification (pages 164, 167, and 174). Example 2(g) 
presents the results of an association study between schizophrenia and the biallelic markers of the 
invention. In that study, it was found that "the relative difference in Mean normalized LD for BAG 
B5 is significantly higher when the comparison was made between familial cases and controls than 
when the comparison was made between the whole cases and the controls [emphasis added]" 
(page 167). 
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That the issue of population stratification is well known to persons of ordinary skill in the art 
is evidenced by the study presented in the paper by Brant et ai ([2000] Gastroenterology 
1 19: 1483-90; Exhibit D) dealing with genetic analysis of the analogously complex Crohn's disease 
(CD) and as discussed therein with reference to other studies. In the paper by Brant et ai, it was 
found that: "Pedigrees with CD diagnosed at an earlier age have greater linkage evidence for IBDl . 
This effect is greatly magnified when the individuals with younger onset also have relatively more 
severe disease.,,'' (page 1488) [emphasis added]. 

The issue of statistical power is well known to Applicants as evidenced by patents (for 
example, U.S. Patent No. 6,291,182; Exhibit E) and proprietary software directed to the statistical 
analysis involved in determining whether a genomic region is associated with a detectable trait such 
as schizophrenia. 

Applicants further dispute the Patent Office's assertion of "unpredictability in this linking of 
schizophrenia with chromosomal locations" (page 7, November 19, 2001 Office Action) in view of 
Applicants" success in further associating schizophrenia with gene g35030 (WO 01/40493; 
Exhibit?) and in view of others' success in identifying an associafion of catechol-o-methyl 
transferase (COMT) gene with schizophrenia [Weinberger et ai [2001] Biological Psychiatry 
50:825-44; Exhibit C]. 

In view of the foregoing evidence. Applicants respectfully submit that the Patent Office 
allegation regarding the "unpredictability in this linkage of schizophrenia with chromosomal 
locations" (page 7, November 19,2001 Office Action) is improper. Reconsideration and withdrawal 
of the rejection is respectfully requested. 
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Quantity of Experimentation 

The Office Action alleges that "an immense amount of experimentation would be required in 
order to define whether this protein is associated with any particular disease state" and concludes that 
"undue experimentation would be required to use this invention as claimed" (page 7, November 19, 
2001 Office Action). Applicants respectfully dispute this representation of the invention in view of 
the strong association between the genomic region encoding the present protein and schizophrenia, in 
view of the fact that the present protein is expressed mainly in the brain, in view of the fact that the 
protein is a novel member of a family of proteins frequently implicated in neuropsychiatric disorders, 
and in view of G7 1 3 structure and function that is suggestive of roles in neurodegenerative disorders. 
It is further submitted that that there is a correlation of the present protein to a predisposition to the 
onset of neuropsychiatric disorders such as schizophrenia (see supra). Applicants respectfully 
submit that the Patent Office has failed to establish a prima facie case regarding a lack of enablement 
and respectfully request reconsideration and withdrawal of the rejection. 
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In view of the foregoing remarks and amendments to the claims, Applicants believe that the 
currently pending claims are in condition for allowance, and such action is respectfully requested. 

The Commissioner is hereby authorized to charge any fees under 37 C.F.R. §§ 1 . 1 6 or 1 . 1 7 as 
required by this paper to Deposit Account No. 19-0065. 

Applicants invite the Examiner to call the undersigned if clarification is needed on any of this 
response, or if the Examiner believes a telephonic interview would expedite the prosecution of the 
subject application to completion. 



Respectfully submitted. 
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Evidence for Anticipation in Schizophrenia 

Anne S. Bassett* and William G. Honer^ 

•Department of P$ychiatf7, University of Toronto, and Genetics Section, Schizophrenia Research Program, Queen Street Mental Health Centre, 
Toronto; and ^Department of Psychiatry, University of British Columbia, Vancouver 



Summary 

Anticipation, or increasing severity of a disorder across successive generations, is a genetic phenomenon with an 
identified molecular mechanism: expansion of unstable trinucleotide repeat sequences. This study examined 
anticipation in familial schizophrenia. Three generations of siblines from the affected side of families selected 
for unilineal, autosomal dominant-like inheritance of schizophrenia were studied (n = 186). Across generations 
more subjects were hospitalized with psychotic illness (P<.0001), at progressively earlier ages (P<.0001), and 
with increasing severity of illness (P<.0003). The results indicate that anticipation is present in familial 
schizophrenia. These findings support both an active search for unstable trinucleotide repeat sequences in 
schizophrenia and reconsideration of the genetic model used for linkage studies in this disorder. 



Introduction 

The clinical observation of anticipation — i.e., inherited 
illness that becomes more severe across successive gen- 
erations — has recently been found to have a molecular 
basis: expanding GC-rich trinucleotide repeat sequence 
mutations (Harper et al. 1992; Sutherland and Richards 

1992) . In fragile X syndrome (Verkerk et al. 1991), 
myotonic dystrophy (Fu et al. 1992), spinobulbar mus- 
cular atrophy (Brook et al. 1992), spinocerebellar atro- 
phy type 1 (Orr et al. 1993), and Huntington disease 
(Huntington's Disease Collaborative Research Group 

1993) , increasing severity of illness, earlier age at onset, 
and/or increasing proportion of ill individuals in suc- 
cessive generations are associated with longer trinucleo- 
tide expansions. Schizophrenia is another neuropsychi- 
atric disorder that may display this anticiparion 
phenomenon and that therefore may have familial 
forms caused by an unstable trinucleotide repeat. 

Schizophrenia is a severe disorder characterized by 
social withdrawal and psychotic symptoms, such as de- 
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lusions and hallucinations. The illness has a variable age 
at onset, often beginning in early adulthood and result- 
ing in lifelong disabilities in social and occupational 
functioning. Evidence from family, twin, and adoption 
studies, including those using reliable diagnostic crite- 
ria (Lowing et al. 1983; Kendler et al. 1985), strongly 
suppons a genetic etiology for schizophrenia (Gottes- , 
man and Shields 1982). However, the mode of inheri- 
tance for schizophrenia is not readily identifiable and is 
proposed to involve interacting genes (Risch 1990). In 
families with the illness, reduced penetrance and vari- 
able expression are commonly found. Other psychoric 
disorders, of lesser severity, and schizotypal personality 
traits such as social isolation, odd communicarion, and 
extreme suspiciousness are conditions likely reflecting 
variable expression of genetic suscepribility to schizo- 
phrenia (Gottesman and Shields 1982; Lowing et al. 
1983; Kendler et al. 1 9 85). These factors, along with the 
possibility of genetic heterogeneity and the pracrical 
difficulties of studying an illness with a significant sui- 
cide rate and suspicious, socially isolated individuals, 
combine to make schizophrenia a challenging disorder 
for linkage studies (Bassett 1991), Strategies to over- 
come these difficulties include focusing on familial 
schizophrenia where inheritance is consistent with 
Mendelian patterns, using reliable diagnostic methods, 
highly polymorphic DNA markers, and lod-score 
methods that model the complexities of the inheri- 
tance. Linkage studies to date, using informative fami- 
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lies and models based on Mendelian inheritance, have 
yielded significant lod scores but no replicated positive 
results (Bassett 1991). 

Dynamic modifications of classical patterns of ge- 
netic transmission, such as anticipation (Mott 1911) 
and genomic imprinting (differential expression of ge- 
netic material depending on parental origin of the gene 
and the underlying molecular mechanisms), may ex- 
plain the complex genetics of schizophrenia and other 
major mental illnesses (Mclnnis et al. 1991; Flint 1992). 
For example, trinucleotide repeats can cause reduced 
penetrance and variable expressionY^askey et al. 1992; 
Sutherland and Richards 1992), by existing in a premu- 
tation form, by reductions in repeat size, or by somatic 
mutation in early embryogenesis (Lavedan et al. 1993). 
The unstable nature of trinucleotide repeats also pro- 
vides a possible mechanism for the high mutation rates 
proposed for schizophrenia^later and Cowie 1971). If 
there were evidence of anticipation in schizophrenia, 
screening for triplet repeat mutations would become a 
rational option for gene localization studies, and modi- 
fication of the genetic model used in linkage studies 
would need to be considered. The current study inves- 
tigated whether anticipation was present in a familial 
schizophrenia sample participating in a linkage study. 

Subjects and Methods 

Subjects were members of eight extended noncon- 
sanguineous families participating in a genetic linkage 
study of familial schizophrenia. Local psychiatrists 
identified prospective pedigrees segregating schizophre- 
nia. Families were selected for large size, availability of 
two or more generations of adults, and apparent unilin- 
eal, autosomal dominant-like inheritance of schizo- 
phrenia and genetically related disorders. Bilineal fami- 
lies with evidence, from family or collateral history, of 
schizophrenia or other nonafFective psychotic dis- 
orders on both sides were excluded. Further details of 
the original ascertainment and assessment for the link- 
age study are described elsewhere (Bassett et al. 1993). 
Since families were ascertained in their entirety, pro- 
band status could be assigned to all affected subjects. 
Therefore no subjects were excluded from analyses. 

Subjects only from the affected side of each family 
were taken into account, to determine sibling sets (sib- 
lines) in the index generation (IG) {n = 13 siblines), 
parental generation (PG) (n = 10 siblines), and grand- 
parental generation (GG) (n = 8 siblines). The affected 
side was defined by (1) a parent hospitalized with psy- 
chosis (four cases), (2) an aunt/uncle hospitalized with 



psychosis (four cases), (3) a parent/aunt/uncle with 
schizotypal traits (seven cases), or (4) a sibship linking 
two affected nuclear families (six cases in two extended 
families) (see fig. 1). In two cases in the GG, the aflPected 
side could not be determined, and the smaller of the 
unaffected maternal or paternal siblines was arbitrarily 
selected. There were three instances of unknown pater- 
nity. In two of these cases, the maternal line was af- 
fected. In the third case, neither the mother nor her five 
siblings were affected, and this GG sibline was not in- 
cluded in the analysis. 

Family-history information was obtained for each 
subject from three or more family members by using 
the Family History-Research Diagnostic Criteria (FH- 
RDC) method (Andreasen et al. 1977). Genealogical re- 
cords were used to confirm dates of birth and death. 
Medical records were searched back to 1866 for evi- 
dence of psychiatric hospitalization. Because the sub- 
ject familieis originated and seldom moved from a cir- 
cumscribed region of Canada, and because the one 
psychiatric hospital available until the 1980s consis- 
tently maintained a comprehensive file-card system of 
recording admissions, virtually complete ascertainment 
of psychiatric hospitalization was assured. Records 
were collected for all subjects with a history of psychiat- 
ric hospitalization. Living subjects participating in the 
linkage study were directly interviewed by a psychiatrist 
(Bassett et al. 1993). Diagnostic folders containing the 
family history and, if present, medical records and in- 
terview data were reviewed independently by two psy- 
chiatrists (A.S.B. and W.G.H.), one of whom (W.G.H.) 
was blind to the pedigree structures. A consensus life- 
time Research Diagnostic Criteria (RDC) diagnosis for 
psychotic disorders, age at first hospitalization for a 
psychotic illness, and presence of two or more RDC 
schizotypal traits were recorded. Psychoric disorders 
included schizophrenia (n - 25), schizoaffective dis- 
order (« = 13; 12 mainly schizophrenic type and 1 
other), unspecified functional psychosis {n = 4), mania 
with psychosis {n = 1), and depression with psychosis (« 
= 1). Schizophrenia and schizoaffective disorder were 
approximately equivalent in severity in the current sam- 
ple (Bassett et al. 1993) and were considered together in 
the current study. Individuals with psychotic disorders 
not severe enough to require hospitalization, as well as 
subjects with schizotypal traits, were combined in a 
single schizotypal group, because of small numbers in 
each category. 

Of the 209 subjects in the affected siblines, 23 were 
excluded from the analyses. One IG subject had not 
attained the age of 15 years, considered a minimum age 
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Figure I Three pedigrees of the eight fam- 
ilies studied, illustrating anticipation in familiai 
schizophrenia. IG. PC, and GG siblines on the 
affected side arc shown. The numbers below indi- 
viduals indicate their age at first hospitalization 
for psychotic illness. An unblackencd square (□) 
denotes an unaffected male; an unblackened cir- 
cle (O) denotes an unaffected female; a black- 
ened square (■) or circle (•) denotes hospitaliza- 
tion for a psychotic disorder and a 
half-blackened square (CI) or circle (O) denotes 
schizotypal conditions. Sex and birth order of 
some individuals have been changed to protect 
confidentiality. A slash (/) through the symbol 
denotes that the individual is deceased. The box 
outlines a single affected lineage from a family 
connected at the grandparcntal level. 
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of nsk for psychotic illness (Gottesman and Shields 
1982). Five subjects (2 PG and 3 GG) had moved, and 
collateral information was insufficient to determine 
hospitalization status; and 17 subjects (3 IG, 5 PG, and 
9 GG) died before the age of 40 yean. Thirteen died in 
infancy or childhood, two in war, one in an accident at 
work, and one of unknown cause; none were suicides. 
Most new cases of schizophrenia may be expected be- 
fore age 40 years (Gottesman and Shields 1982). Data 
on the remaining 186 subjects were examined for antici- 
pation, in three ways. First, the rates of hospitalization 
for psychotic disorders and the rates of schizotypal 
conditions were compared across generations by using 
X analyses. Second, to assess severity of illness, su bjects 
were assigned the following ratings: hospitalized with 
schizophrenia or schizoaffective disorders— 3; hospital- 
ized with other psychotic disorders— 2; schizotypal— 
1; and unaffected— 0. Means for each generation were 
compared using the one-way analysis of variance (AN- 
O VA), including correction for multiple tests of signifi- 
cance with the Student-Newman-Keuls procedure. 
Third, age at first hospitalization for psychosis was as- 
sessed using the life-table method of survival analysis 
for 1-year intervals. Homogeneity of survival curves 
over the generations was examined using the Wilcoxon 



test. The analysis was performed assuming (1) no differ- 
ential mortality between affected and unaffected and 
(2) hospitalization rates independent of chronological ' 
time (e.g., 1920 vs. 1970). Observations ended at the 
subject's current age or age at death. Covariates tested 
were sex and transmission patterns (matemal/patemai). 

Results 

Demographic characteristics of the sample and re- 
sults indicating anticipation are presented in table 1, As 
for other illnesses demonstrating anticipation, expres- 
sion of illness varied between members of a sibship 
(fig. 1). 

Rotes and Distribution of Illness 

There were significantly more subjects hospitalized 
for psychosis across successive generations (x* = 16.84, 
P<.0001, 2 df). Most had schizophrenia or schizoaffec- 
tive disorders (IG, n = 30; PG, « = 8; and GG, n = 0). Of 
the six subjects with less severe disorders— unspecified 
functional psychosis (« = 4), psychotic mania (« = 1), or 
psychotic depression (n = 1)— four were in PG or GG. 
Subjects with the least severe illnesses (schizotypal con- 
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Table I 



Characteristics of the Sample, by Generation (n = 1 86) 




IG 


PG 


GG 


Total no. of subjects (females) 


86 (38) 


62 (24) 


38 (21) 


Mean size of sibline (SD) 


6.61 (4.27) 


6.20 (3.61) 


5.43 (4.20) 


Mean age of living* (SD) 


40.61 (8.74) 


65.55 (12.96) 


71.80 (8.26) 


Mean age ar death'' (SD) 


44.25 (3.20) 


57.15 (14.58) 


77.35 (9.34) 


Mean age at first hospitalization (SD) 


26.16 (8,28) 


34.00(17.28) 


41.33 (15.89) 


Subjects hospitalized for psychotic illness . . 


32(37.21%) 


9(14.52%) 


3 (7.89%) 


Subjects with schizotypal conditions 


8(9.30%) 


8 (12.90%) 


8(21.05%) 



" No. of living subjects: IG. 82; PG, 42; and GG, 10. 

** No. of dead subjects with known age at death: IG, 4; PG, 20; and GG. 20. Eight other GG subjects had 
ages at death that were less precisely known, e.g.. "in their 70s." The mean shown did not change signifi- 
cantly when these subjects were included using estimated ages at death. 



ditions) were twice as common in the GG as in the IG 
(21.05% vs. 9.30%); however, across the three genera- 
tions, the result was not significant (x^ = 3.24, P = .20, 
2 dO. When these subjects with schizotypal conditions 
were included with hospitalized subjects, there were 
still significantly more aflFected subjects in the youngest 
generation (x^ = 6.86, P = .032, 2 df). 

Because of the possibility of compounded error with 
families being connected at the parental or grandparen- 
tal level, analyses were rerun with a single affected lin- 
eage from each of the seven kindreds that had data 
from all three generations, with the largest siblines be- 
ing selected (see fig. 1). The same results were found 
using this subsample of 151 subjects, for both hospital- 
ization rates (x^ = 12.30, P<.002, 2 df) and hospitaliza- 
tion plus schizotypal rates (x^ = 9.46, P<.009, 2 df). 
Excluding subjects who died before age 40 or moved 
away (n = 22) could have influenced the study's find- 
ings, since most were in the two senior generations. 
When results with these subjects and with penetrance 
estimated maximally at 100% were considered, so that 
one-half (« = 11) would have been hospitalized with 
psychotic disorders, the results for hospitalization rates 
would still have remained significant (x^ = 10.02, 
P<.01, 2 df). Data were also reanalyzed examining the 
effect that increasing hospitalization rates over time 
would have on the results. Secular trends of hospitaliza- 
tion rates for specific illnesses were not available. There- 
fore, arbitrary increases were tested, by adding 100% 
more subjects (n ~ 3) to the hospitalized GG group and 
50% more {n = 4.5) to the PG group. Under these con- 
ditions, a significant increase in the rate of psychosis 
requiring hospitalization would continue to be present 
oyer the generations (x^ = 7.75, P<.05, 2 dO. 



SevehtY of Illness 

Means for four-point severity-of-illness ratings for 
the three generations were as follows: IG (n - 86), 1.19; 
PG (« = 62), 0.58; and GG (n = 38), 0.37. Severity of 
illness significantly increased over the generations (F 
= 8.39, 2 df, P = .0003). Pairwise comparisons using 
the Student-Newman-Keuls test revealed that the signif- 
icant difference was between the IG and the PG and 
GG. PG and GG severity means were not significantly 
different from each other. 

Age Qt First Hospitalization 

The survival curve (fig. 2) shows that subjects were 
first hospitalized for psychosis at progressively younger 
ages, across generations (x^ = 26.76, P = .0001, 2 df). 
Sex of subject was not a significant covariate to genera- 
tion, for age at first hospitalization (increment x^ 
= 0.40, P = .53, 1 df). All of the GG and most (68%) of 
the PG subjects had achieved an age of 55 years or 
more, by which time they would have been virtually 
through the age at risk for schizophrenia (Gottesman 
and Shields 1982). 

Imprinting 

There were equal rates of maternal and paternal 
transmission (six and seven cases, respectively) from the 
PG to the IG. The mean age at first hospitalization for 
maternal transmission (PG to IG) was 25.00 years (SD 
5.94 years), and that for paternal transmission was 
27.64 years (SD 10.63 years), a nonsignificant difference 
{t - .83, P = .41). Grandparental to parental transmis- 
sion was predominantly maternal (seven of nine cases), 
with one unknown and only one example of paternal 
transmission. Sex of transmitting parent was not a sig- 



868 



Bassett and Honer 




Ago (years) 



Figure 2 Survival curves for age at first hospitalization for 
psychotic illness, comparing IG (A), PC (•), and GG (■). Annual 
percent surviving without hospitalization is plotted for every 5th 
year. Observation of individuals ended at theirage of death or, if they 
were living, at their current age. 

nificant covariare in the survival analysis examining age 
at first hospitalization (increment = 0 32 P = ^7 
2df). r o/. 

Discussion 

The results suggest that familial schizophrenia ex- 
hibits anticipation. All of the families studied showed 
this phenomenon, manifest as increasing rates of hospi- 
talized psychotic illness, worsening severity of illness 
and/or earlier age at onset, across successive genera- 
tions (fig. 1). These findings are consistent with differ- 
ences in rates of hospitalization and age-at-onset data 
for parent-child pairs in studies of schizophrenia over 
the century (Mott 1910; Kay 1963; Penrose 1971- De- 
cina et al. 1991). In each of these studies, rates of hospi- 
talization for psychosis were lower for antecedent gen- 
erations, and age at onset for parents was significantly 
later than that for offspring. Investigations of ancestors 
and extended families also support these findings 
(Karlsson 1966; Odegaard 1972; Wetterbem and 
Farmer 1991). As well, less severe psychotic illnesses 
(e.g., affective disordere) are consistently more com- 
mon in the generation antecedent to schizophrenic 
probands (Slater and Cowie 1971; Bleuler 1978) These 
results complement reportedmorbid risk of schizophre- 
nia for parents, which is almost half that for siblings 
(Gottesman and Shields 1982). While alternative rea- 
sons, including selection biases such as reduced fertility 



in earlier-onset schizophrenia, have been proposed to 
explain these clinical observations, they are all consis- 
tent with the phenomenon of anticipation in schizo- 
phrenia. 

In contrast, findings from the current study are only 
suggestive of sex-specific differences in the transmis- 
sion of schizophrenia. However, the possibility of an 
excess maternal over paternal transmission in schizo- 
phrenia IS consistent with trends found both recently 
by others (Sharma et al. 1993) and in studies of large 
data sets in the older literature (Penrose 1971- Slater 
and Cowie 1971). Fertility may be especially low in 
male patients with schizophrenia (Gottesman and 
Shields 1982), and in the current study this could be the 
reason for the high rate of maternal transmission from 
the GG lines to the PG lines. If this were the case 
however, one would have expected predominantly ma- 
terna inheritance from the PG to the IG lines; but ma- 
ternal and paternal rates were equal. Comparable trans- 
mission patterns have been found to be due to greater 
variation of trinucleotide repeat length after female 
meiosis in myotonic dystrophy (Lavedan et al. 1993) 
The effect may be subtle, requiring larger samples to 
demonstrate imprinting in schizophrenia. 

There are several possible biases that can explain re- 
sults that indicate anticipation (Penrose 1948). First 
subjects could have died before expressing the muta- 
tion. However, all of the GG and most of the PG lived 
beyond age 55 years. Also, significantly different hospi- • 
talization rates across generarions remained in the 
current study, even when half of those who had moved 
or died before age 40 years were assigned affected sta- 
tus. Second, reduced fertility of individuals with earlier 
onset of schizophrenia could cause preferential ascer- 
tainment of parents with later onset. This bias should 
be minimized m the current study, because (1) few par- 
ents were affected with psychosis and (2) siblines were 
large in all three generations, providing multiple oppor- 
tunities for detection of affecteds, regardless of their 
fertility. Third, subjects in the IG could have been too 
young to yet express a late-onset form of psychosis, 
which would attenuate the age-at-onset findings. Since 
one-half of the subjects in the IG were over age 40 
years, most were beyond the period of highest risk, 
tven If more new cases of psychosis subsequently 
arose, this would most likely occur in the IG and would 
only serve to strengthen the present study's findings 
with respect to differential rates of illness. 

In contrast to major depression (Gershon etal. 1987) 
there IS no evidence for a cohort effect in schizophre- 
nia. However, secular trends, such as improved detec- 
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tion, that could, over time, lead to higher hospitaliza- 
tion rates and/or younger age at first hospitalization 
for psychosis are important to consider, since these 
could have influenced the principal findings of the 
current study. In the literature (Mott 1910; Kay 1963; 
Penrose 1971; Decina et al. 1991), examination of ages 
at first hospitalization did not reveal secular trends to 
younger age over the century. In the current study, ar- 
bitrary increases in hospitalization incidence assigned 
to the PG or GG did not change the observation of 
anticipation. Specific factors that may influence secular 
trends in hospitalization, including drug abuse and psy- 
chosocial stressors, do not appear to have played a role 
in the sample studied. Only three hospitalized subjects 
in the IG had a history of stimulant or hallucinogen use, 
drugs that may in some cases precipitate a psychotic 
illness. On the basis of direct interviews, it appeared 
that psychosocial stressors endured by PG and GG, 
such as the World Wars and Great Depression, v^ere 
more severe than those faced by the IG. 

Another possibility is that the family-history method 
tends to underestimate rates of psychiatric disturbance, 
particularly in relatives who are dead or less known 
(Andreasen et al. 1977). The consequence could be that 
actual rates for schizotypal traits could be higher than 
those found, particularly for PG and GG. However, 
results for severe illness requiring hospitalization would 
remain unchanged. Another factor that could have 
compounded errors in the PG and GG was the use of IG 
siblines connected at the parental or grandparental 
level. However, both the fact that results remained the 
same when only one ascending line from each kindred 
was examined and the fact that at least 13 aflfected 
siblines had resulted from eight originating GG lines 
support the finding of anticipation. 

A limitation that is important to consider in studies 
of common illnesses is the potential for assortative 
mating to cause an apparently increased prevalence of 
illness in offspring. This possibility was minimized by 
selecting unilineal pedigrees with no evidence of schizo- 
phrenia or related disorders in the married-in person, 
their siblings, or parents. Although individuals marrying 
in could have been nonexpressing carriers of the dis- 
order with nonexpressing close relatives, the likelihood 
appears small that assortative mating could account for 
the results in the current study. Because the families 
studied were selected because of their autosomal domi- 
nant-like inheritance and large sibships, the results may 
not be generalizable to schizophrenia in the general pop- 
ulation, although they are consistent with observations 
from large population-based samples {Penrose 1971; 



Bleuler 1978). Also, the schizophrenia in the subject 
families may be a particularly severe form, which could 
exaggerate the findings. However, the mean age at on- 
set for the IG is similar to others' results (Mott 1911; 
Gottesman and Shields 1982; Decina et al. 1991; 
Sharma et al. 1993). Data on specific symptom patterns 
suggest that the familial schizophrenia in the present 
families is comparable in nature and in severity to sam- 
ples drawn from the general population (Bassett et al. 
1993). 

Despite the possible biases and limitations, the 
weight of evidence from both the current investigation 
and the literature is consistent with the finding of antici- 
pation in familial schizophrenia. Other families with 
schizophrenia should be examined for anticipation and 
possible accompanying maternal imprinting phenom- 
ena, to confirm the current study's findings. Although 
published pedigrees consistently show evidence of an- 
ticiparion (Karisson 1966; Wetterberg and Farmer 
1991), complete ascertainment of other large kindreds 
with contemporary reliable diagnosric assessments 
would be useful. However, the most exciting possibili- 
ties—and the confirmanon of the clinical observations 
of the current study— lie in the search for expanding 
trinucleotide repeats and other DNA sequence muta- 
tions in schizophrenia. New methods becoming avail- 
able to detect these mutarions (Orr et al. 1993; Schal- 
ling et al. 1993) will complement and may accelerate the 
search for pathological genes in linkage studies. In ad- 
dition, the current study has implications for the ge- 
netic model used in linkage studies. Parameters, particu- 
lariy penetrance, which would vary according to 
generarion, may need to be modified to reflect the ef- 
fects of anticipation and possibly imprinring. In the 
light of clinical evidence for ^nricipation, these strate- 
gies represent real promise for deciphering the genetics 
of schizophrenia. 
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Abstract ^ 

Protein methylation is a posttranslational modification that can potentially regulate signal transduction 
pathways in a similar manner as protein phosphorylation. The role of protein methylation in NGF signaling 
was examined by metabolic labeling of PC12 cell proteins with L-[methyl-^H]methiomne and by in vitro 
labeling of cell proteins with L-[methyl-^H]S-adenosylmethionine. Effects of NGF were detected within 15 
min. Methyl-labeled proteins were resolved by one and two dimensional SDS-PAGE. NGF affected the 
methylation of several 68-60-kD proteins (pi 5.8-6.4) and 50-kD proteins (isoelectric point pH 6.7-6.8 and 
5.8-6.2). Several NGF-induced changes in methylation increased over several hours and through 4 d. 
Moreover, methyl labeling of several specific proteins was only detected after NGF treatment, but not in 
nontreated controls. The effects of NGF on protein methylation were NGF specific since they were not 
observed with EGF or insulin. A requirement for protein methylation for neurite outgrowth was 
substantiated with either of two methylation inhibitors: dihydroxycyclopentenyl adenine (DHCA) and 
homocysteine. DHCA, the more potent of the two, markedly inhibits protein methylation and neurite 
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outgrowth without affecting cell growth, NGF-induced survival, cell flattening, or several protein 
phosphorylations that are associated with early signaling events. Removal of DHCA leads to rapid protein 
methylation of several proteins and concurrent neurite outgrowth. The results indicate that NGF regulates 
the methylation of several specific proteins and that protein methylation is involved in neurite outgrowth 
from PCI 2 cells. 



PROTEIN methylation is a posttranslational modification that may be used to regulate signal transduction 
and differentiation pathways by mechanisms that are analogous to regulation by protein phosphorylation 
(Hrycyna and Clarke, 1993B; Rando, 1996H). Although a specific role for protein methylation in 
prokaryote chemotaxis is well established (Shapiro et al, 19950), possible roles for protein methylation in 
eukaryotic signaling mechanisms have not been extensively explored. Protein carboxyl methylations are 
reversible, and regulatory roles for carboxyl methylation have been proposed for chemoattractant responses 
in neutrophils (Philips et al., 19930, 19950), insulin secretion from pancreatic islets (Metz et al., 19930), 
and photoreceptor signal transduction (Parish et al., 19950). Among signaling proteins that are known to'be 
carboxyl methylated are the Ras and Rho family of small G-proteins (Hrycyna and Clarke, 1993Et Rando, 
199QI1), y subunits of heterotrimeric G-proteins (Philips et al., 19930; Rando, 19960), and the catalytic 
subunit of protein phosphatase 2A (Lee and Stock, 19930 Favre et al., 19940 Xie and Clarke, 19940). 
Protein phosphatase 2A is also demethylated by a specific protein carboxyl methylesterase (Lee et al., 1996 
IS). Many proteins are known to be N-methylated at arginine, lysine, or histidine residues including 
cytoskeletal proteins (actin and myosin), nuclear proteins (nucleolin, fibrillarin, histones, heterogeneous 
nuclear RNPs), the multifunctional calcium binding protein, calmodulin, and FGF-2. The physiological 
fiinctions of methylation, however, remain largely unknown. Moreover, potential mechanisms for 
regulating protein methylation within growth factor signaling pathways remain to be explored. 

Methylation pathways use S-adenosylmethionine (SAM)» as the universal methyl donor for 
methyltransferase-catalyzed methylation of proteins and other methyl acceptors. The role of methylation in 
NGF signal transduction was previously studied using high concentrations (millimolar) of inhibitors of 
methyltransferases that use SAM (Seeley et al., 19840; Kujubu et al., 19930). These attempts to examine 
the role of methylation in cell signaling were somewhat compromised by the lack of specific, nontoxic 
inhibitors of methylation. In addition to the methylated acceptor, S-adenosylhomocysteine (SAHcy) is a 
product of all methyltransferase reactions. SAHcy is a strong competitive inhibitor of SAM (Hildeshein et 
al., 19720) and is normally removed by hydrolysis in a reversible reaction (De la Haba and Cantoni, 
1959±) catalyzed by S-adenosylhomocysteine hydrolase (SAHH). Thus, inhibition of SAHH offers an 
alternative means to inhibit methyltransferases. This approach was used in the present work to study 
protein methylation during NGF signaling. The crucial role of SAHH and methylation in development is 
evident from the nonagouti (a^) mutation. The lethality of this mutation is due to a deletion of the SAHH 
gene, and embryonic development is arrested in the preimplantation blastula stage (Miller et al., 19940). 

The PCI 2 clonal cell line was used to examine the role of protein methylation in NGF signal transduction. 
PC 12 cells are derived from a rat pheochromocytoma and serve as a model of neuronal differentiation. 
When exposed to NGF, PC12 cells assume many of the features of sympathetic neurons including cell. 
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cycle arrest, survival in serum-free medium, and elaboration of long neurites (Greene and Tischler, 1976[±t 
Tischler and Greene, 19780). Moreover, PC 12 cells also respond to other growth factors. For example, in 
the presence EGF, PCI 2 cells do not differentiate into neuronlike cells, but do increase their mitotic rate 
(Huff and Guroff, 1981). Thus, PC 12 cells are useful to study specific growth factor signaling mechanisms. 
The specific signaling events and proteins involved in neurite outgrowth have not been fiilly elucidated. For 
example, NGF and EGF both activate several signaling proteins including phospholipase C, 
phosphoinositide (PI)3-kinase, extracellular signal-related kinase (ERKs) (MAP kinases), and Ras (Kaplan 
and Stephens, 1994|±t Marshall, 19950). However, NGF, but not EGF, stimulates neurite outgrowth in 
PC 12 cells. One hypothesis for the specificity of NGF actions involves kinetic differences in the activation 
of both Ras and ERK (Marshall, 19950). NGF stimulation of PC 12 cells results in prolonged activation of 
both Ras and ERK activity, while EGF causes a transient rise and fall in Ras and ERK activities (Qui and 
Green, 19920). Specificity may also arise from NGF-specific signaling pathways that are not activated by 
EGF and other mitogens (Chao, 19920 Kaplan and Stephens, 19940 Peng et al., 19950). The results 
reported here indicate that signaling pathways involving protein methylation may contribute to NGF 
specificity. 

Previous studies by Seeley et al. (1984)0 have implied that methylation is necessary for neurite outgrowth 
in PC12 cells. Kujubu et al. (1993)0 and Haklai et al. (1993)0 reported that methylation of small 
G-proteins is regulated by NGF in PC12 cells. Najbauer and Aswad (1990)0 have also characterized 
several methyl-arginine containing proteins in PC12 cells. The effects of NGF on protein methylation and 
the effect of inhibition of protein methylation on neurite outgrowth are, however, incompletely 
characterized and have been di fficult to interpret due to the lack of specificity of the methylation inhibitors 
used in previous studies. The present work extends previous findings by examining the methylation of 
specific cellular proteins and neurite outgrowth before and after inhibition of the protein methylation 
pathway. 9-{trans-2', /ran5-3'-dihydroxycyclopent-4-enyl)-adenine (DHCA), a specific, mechanism-based 
inhibitor of SAHH, inhibits protein methylation and greatly decreases neurite outgrowth. The results 
indicate that NGF, but not EGF or insulin, regulates the methylation of several specific proteins and that 
protein methylation is required for neurite outgrowth from PC 12 cells. 



Materials and Methods ^ 
Reagents 

NGF was purified from male mouse submaxillary glands as described previously (Mobley et al., 19760). 
Lisulin, L-homocysteinethiolactone, anisomycin, leupeptin, PMSF, aprotinin, and monoclonal 
phosphotyrosine antibody (clone PT-66) were purchased from Sigma - Chemical Co. (St. Louis, MO). 
Homocysteine was prepared by incubating L-homocysteine thiolactone with 40 mM NaOH at 37°C for 
30 min. EGF was obtained from Upstate Biotechnology hic. (Lake Placid, NY). DHCA (kindly provided by 
R.T. Borchardt, University of Kansas, Lawrence, KS) was prepared as 1 and 100 mM stock solutions in 
DMSO. Erythro-9-(2-hydroxy-3-nonyl) adenine was kindly provided by D. Porter (Burroughs Wellcome, 
Research Triangle Park, NC). L-[methyl-3H]methionine (71 .4 Ci/mmol), L-[methyl- ^HjSAM 
(70 Ci/mmol), ['"'CJadenosine (59.8 mCi/mmol), and carrier-free [^^pjorthophosphate were purchased 
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■ from Dupont-NEN (Boston, MA). l^Sj.donkey anti-rabbit IgG was purchased from Amersham Corp 
(Arlington Heights, IL). Protein A-Sepharose (PAS) and ampholines were obtained from Pharmacia Fine 
Chemicals (Piscataway, NJ). Donor horse serum and fetal bovine serum were from JRH Biosciences 
(Lenaxa, KS). Cell culture medium, penicillin, and streptomycin were obtained from Gibco Laboratories 
(Grand Island, NY). Restriction endonuclease Mspl was from New England Binhh.Tno (Beverly, MA) 
and nuclease P 1 from United States Biochemical Corp. (Cleveland, OH). 

Cell Culture and Bioassays 

Stock cultures of PC12 cells were grown on collagen-coated tissue culture dishes (Falcon Plastics 
Cockeysville, MD) in RPMI- 1640 medium supplemented with 10% heat-inactivated horse serum 'and 5% 
fetal bovine serum at 35°C and 7.5% CO2, as previously described (Greene et al., 1987EI). For neurite 
outgrowth studies, PC12 cells (3-5 x 10^ cells) were plated onto collagen-coated 35-mm plastic tissue 
culture dishes and cultured in RPMI-1640 medium containing 1% heat-inactivated horse serum and 
50 ng/ml NGF. Neurites >20 m were counted as neurite-bearing cells. At least 100 cells from each 
expenmental condition were scored from randomly chosen fields. 

For the study of proliferation and survival in the presence and absence of DHCA, naive PC12 cells were 
plated on collagen-coated, 24-well tissue culture dishes at a density of 1 0^ cells per well. 1 d after plating 
cells were washed three times in RPMM640 medium to remove serum and then placed either in complete 
medium (RPMI plus 10% horse serum and 5% fetal bovine serum), in RPMI plus 50 ng/ml NGF or in 
RPMI alone. Where indicated, 1 ^M DHCA was included in the culture medium. The number of viable 
cells was determined by counting intact nuclei using a hemacytometer (Soto and Sonnenschein 1985H) 
immediately after washing the cultures with RPMI at time intervals of 1 , 3, and 5 d later. 

For the study of neurite regeneration, PC12 cells were treated with NGF for 7-14 d NGF was then 
thoroughly washed away from the cells, and the cells were detached from the culture dish by trituration 
Cells were then replated in the presence or absence of NGF, and neurite outgrowth was scored 24 h later. 

Metabolic Radiolabeling of Proteins 

To assess endogenous protein methylation under various experimental conditions, PC12 cells were cultured 
in 1% horse serum for at least 16 h before metabolic radiolabeling for 6 h with 100 nCi/ml 
L-[methyl- H]methionine in the presence of 10 mM anisomycin at 35°C in a CO^ incubator. The 
methionine/cysteine content of RPMI-1640 was reduced by 80% during the labeling period. Growth factors 
Q^GF, EGF. or insulin) and/or 1 ^M DHCA were added for the indicated times. At the conclusion of the 
labeling penod. cells were washed three times with 1 ml of PBS (35°C), harvested in 200 ^1 SDS-PAGE 
lysis buffer, and held in a boiling water bath for 5 min. Total radiolabeled protein in each lysate was 
determined by TCA precipitation and liquid scintillation spectrometty. The averages of the determinations 
are given under Results as the means ± SEM. Aliquots of each cell lysate containing equal 
TCA-precipitable counts per minute were subjected to SDS-PAGE as described below. For analysis of 
protein methylation by two-dimensional (2D) lEF x SDS-PAGE, the cells were harvested in lEF lysis 
buffer (Aletta and Greene, 19870) without heating the sample. 
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. To assess changes in protein phosphorylation, PC 12 cell proteins were metabolically radiolabeled as 
previously described (Aletta, 1996H) in a Hepes-buffered Krebs-Ringer solution containing 0.1% 
D-glucose. When the effect of DHCA was examined, the drug was added to cultures 30 min before addition 
of the radioisotope. The labeling was carried out with 50 nCi/ml [^^pjorthophosphate at 35°C in room air 
for 2 h. At the end of the labeling period, cells were washed three times with 1 ml of PBS (35°C), harvested 
in SDS-PAGE lysis buffer, and held in a boiling water bath for 5 min. Total radiolabeled protein in each 
lysate was determined by TCA precipitation and liquid scintillation spectometry. 

In Vitro Protein Metbylation 

To measure protein methylation in subcellular fractions, cells were washed three times in PBS (4°C) and 

harvested by scraping with a rubber policeman in ice-cold homogenization buffer (100 mM Tris, pH 

8.0, 1 mM EDTA, 2 PMSF, and 10 ^ig/ml leupeptin). After homogenization at 4°C in a Dounce 

homogenizer, the homogenates were centrifuged at 4°C for 10 min at 500 g to obtain a crude nuclear 

fraction. The supernatant was centrifuged at 20,000 g for 5 min at 4°C. The resulting supernatant was 

centrifuged at 100,000 g for 90 min at 4°C to obtain the cytoplasmic fraction. The pellet of the 100,000 g 

spin was washed three times in homogenization buffer and resuspended in 150 \il of homogenization buffer 

to obtain the membrane fraction. Protein in each fraction was determined spectrophotometrically (Bradford, 

19703). Equal amounts of protein (125 ^g) from each fraction were incubated with 4.25 iiCi 

L-[methyl-3H]SAM in a total volume of 50 ^il for 1 h at 37°C. The labeling reaction was stopped by adding 

12.5 nl of 5x SDS-PAGE sample buffer (0.3 M Tris-HCl, pH 6.8, 45% glycerol, 1.4 M 2-mercaptoethanol, 

10% SDS, 0.001% bromophenol blue) and holding the samples in a boiling water bath for 5 min. In 

vitro-Iabeled proteins from the cell fractions were then analyzed by 7.5-15% gradient SDS-PAGE as 
described below. 



Gel Electrophoresis 

Discontinuous SDS-PAGE (Laemmli, 1970E) was performed with 19-cm separating gels composed of 
polyacrylamide gradients of 6-12, 7.5-15, or 8.5-15%, depending upon the experiment. Gels were fixed, 
stained with Coomassie blue, and then destained. Gels containing proteins labeled with 
[^2p]orthophosphate were dried and placed in contact with Kodak XAR film to produce an 
autoradiographic image. Gels containing tritium- labeled proteins were prepared for fluorography by 
washing the gel for 1 h in three changes of deionized water, followed by immersion in 1 M sodium 
salicylate for 30 min (Chamberiain, 197913). After drying, the gels were exposed to preflashed Kodak XAR 
film (Laskey and Mills, 1 9750) and stored at -70°C (Bonner and Laskey, 1 9743) with an intensifying 
screen. Quantitative comparisons of methyl-labeled proteins were obtained by scanning fluorograms into an 
analysis program (Molecular Analyst; Bio Rad Laboratories, Hercules, CA). 

For 2D lEF x SDS-PAGE, PC12 cell proteins were labeled as described above and harvested in a lysis 
buffer appropriate for lEF (Aletta and Greene, 198713). Equal TCA-precipitable counts per minute of cell 
lysates were subjected to ffiF with pH 5-7, and 3.5-10 ampholines at a ratio of 4:1, respectively. Proteins in 
lEF gels were fiirther resolved by SDS-PAGE (the second-dimension) using 12-cm separating gels 
composed of 7.5-15% polyacrylamide gradient. A standard lEF gel containing only lysis solution was used 
to determine the pH range. Fluorographic images of tritium-labeled proteins were generated as described 
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above. 

DNA Methylation 

Total methylation of cytosines in DNA was determined by first digesting PCI 2 cell DNA with Mspl, which 
cleaves methylated or unmethylated CCGG sequences (Bestor et al., 19843). The products are 5 labeled 
with [32p]ATP, and digested with nuclease PI . Methyl-cytosines are resolved from unmethylated cytosines 
by chromatography in isobutyric acid/water/ ammonium hydroxide (66:33:1) and detected by 
autoradiography (Bestor et al., 1984ED. Liquid scintillation spectrometry was used to quantify the results. 

Immunoprecipitation 

Tyrosine phosphorylation of Trk in the presence or absence of DHCA was assessed by 
immunoprecipitation followed by Western blotting. Cells were treated with 100 ng/ml NGF for 5 min, with 
or without 1 nM DHCA. Cells were then washed three times in ice-cold PBS and harvested in 1 ml of a 
lysis buffer containing 1% Triton X-100, 150 mM NaCl, 50 mM Tris-HCl, pH 8.0, 25 mM NaF, 5 mM 
EGTA, 5 mM EDTA, 2 PMSF, and 100 U/ml aprotinin. hisoluble material was removed by 
centrifugation at 4°C for 10 min at 13,000 g. Samples were precleared with 6 mg PAS for 2 h followed by 
centrifugation for 10 min at 13,000 g. Anti-phosphotyrosine monoclonal antibody (clone PT-66) was added 
to equal amounts of lysate protein for 2 h on a rotating platform at 4°C followed by addition of 3 mg of 
PAS and incubation on the rotating platform at 4°C for 1 h. PAS beads were then recovered by 
centrifugation at 1 3,000 g for 1 0 min, and washed three times with 1 ml of 1 % Triton X-1 00 lysis buffer 
followed by two 1-ml washes in lysis buffer without Triton X-100. The PAS beads were then resuspended 
in SDS-PAGE sample buffer and held in a boiling water bath for 5 min. The precipitated material was 
resolved by SDS-PAGE (7.5% acrylamide) followed by transfer to hnmobilon P membrane ( Millipore 
Cote,, Milford, MA). The blot was probed with 1 ng of rabbit polyclonal Trk antiserum (Santa Cruz 
Biotechnology, Santa Cruz, CA) for 2 h at room temperature followed by incubation with ^25i.donkey 
anti-rabbit IgG. The dried blot was then exposed to Kodak XAR film at -70°C with an intensifying screen. 

SAHH Activity 

PC12 cells were plated on 150-mm dishes (2 x lo^ cells) in the presence of 100 ng/ml NGF plus 10 nM to 
3 nM DHCA, or without added DHCA for 7 d. Cells were then washed three times in ice-cold PBS and 
harvested by scraping in a potassium phosphate buffer, pH 7.0 (25 mM KH2PO4, 25 mM K2HPO4, 1 mM 
dithiothreitol, 0.5 nM PMSF. and 10 ng/ml leupeptin). Cells were homogenized in a Dounce homogenizer, 
and the cell nuclei and debris were removed by centrifugation at 1 3,000 g Cytosol was then prepared by 
centrifugation at 100,000 g for 90 min and the protein concentration determined. SAHH activity was 
determined in the synthesis direction by a TLC method described previously by Hershfield (I979)(±l 
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Previous studies have implicated regulation of protein methylation in the early events of NGF-mediated 
signal transduction (Seeley et al., 1984Et Kujubu et al., 1993II1). To examine this possibility in greater 
detail, protein methylation patterns were assessed after metabolic radiolabeling of cellular proteins with 
L-[methyl-^H]methionine. Fluorograms of the labeled proteins generated from SDS-PAGE and 2D 
lEF X SDS-PAGE were used to detect specific changes in protein methylation after NGF treatment. The 
cellular pool of SAM, the predominant methyl donor in all cells, was radiolabeled using 
L-[methyl-3H]methionine in control PC 12 cells, and PC 12 cells treated with either 50 ng/ml NGF, 5 nM 
EGF or 1 \iM insulin. These concentrations and growth factors were chosen based on the biological effects 
produced by each in PC 12 cells. NGF at 50 ng/ ml produces the maximum neurite outgrowth response. 
EGF at 5 nM enhances cell proliferation (Huff et al., 19810) and 1 insulin is commonly used to effect 
cell survival in serum-free media (Rukenstein et al., 19910). We have verified that receptors for all three of 
these growth factors are present on the PC 12 cells used in these studies, by observation of the 
aforementioned biological effects at the doses indicated (data not shown). Radiolabeled methionine 
equilibrates with the cellular SAM pool within 20 min (Chelsky et al., 19850). A protein synthesis 
inhibitor, anisomycin (10 ^M), was present during the metabolic radiolabeling to prevent incorporation of 
L-[methyl-^H]methionine into newly synthesized proteins. 

In each of five independent trials, all three trophic factors produced a net increase in methyl group 
incorporation into PC 12 cell protein. After 6 h of treatment, EGF and insulin lead to -50% greater 
incorporation (±0.2, SEM) and NGF to a 31% increase (±0.1, SEM). Despite the slightly larger effects of 
EGF- and insulin-promoted increases in total methyl-^H-labeled protein, NGF treatment consistently yields 
protein methylation patterns exliibiting more pronounced changes in specific methylated proteins (Fig. i). 
Several, indistinctly resolved proteins that migrated between 24 and 20 kD showed increased methyl 
labeling relative to total protein methylation after 6 h NGF treatment (Fig. 1, top right). Among the 
methylated species detected in control cell lysates, were nine 68-60-kD proteins with isoelectric point (pi) 
values of 5.8-6.4 (Fig. i, A-G, *, and +). After treatment with 50 ng/ml NGF for 6 h, the proteins labeled A, 
*, and + in Fig. i showed little or no reproducible changes in methylation relative to controls. Although 
there was a net increase in total protein methylation, both increases and decreases in the methylations of 
specific proteins relative to total protein methylation were detected in response to NGF. Decreased labeling 
of proteins B, D, and E was detected with NGF (Fig. 1 b). Methyl labeling of proteins C, F, and G 
increased markedly, and methyl group incorporation into two proteins (N and N) was detected only after 
6 h in NGF-treated cells, but not in controls (Fig. i b). 

Fig. 1. NGF induces specific changes in the pattern of in vivo protein 
methylation observed by 2D lEF x SDS-PAGE. PC 12 cell proteins were 
metabolically labeled with L-[methyl-^H]methionine (100 nCi/ml) in the 
presence of anisomycin (10 nM) for 6 h. Whole cell lysates containing equal 
TCA-precipitable cpm (800,000) were separated by lEF. lEF gels were 
loaded onto 8.5- 15% gradient gels for the separation of methylated proteins 
by molecular mass. Fluorographic images generated from the control and 
6 h NGF (50 ng/ml) conditions are illustrated in the top two panels. 
Molecular mass standards (in kD) are indicated at the left of each panel. The 
boxed area in the top panels is enlarged below for control, 6 h NGF and 6 h 
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EG? (5 nM) samples. Fluorograms were generated by exposing x-ray film to the dried gels with an 
enhancing screen for 10 d at - 70°C. The pH gradient of the isoelectric focusing tubes is shown below the 
enlarged images (a-c). Labels (A, *, and +) indicate unaffected methylations, while the remaining letters 
and numbers indicate protein methylations affected byNGF. These results are representative of three 
independent experiments. 

[View Larger Version of this Image (7 IK GIF filet] 



Significant NGF-induced effects on protein methylation were also detected at pi 6.7-6.8 and 5.8-6.2 in the 
50-kD region (Fig. 1, a and b). Labeling of protein 3 was unaffected by NGF. Decreased methyl labeling of 
protein 1 was detected after 6 h NGF treatment, while proteins 2, 4, and 7 showed increased methyl 
labeling, and the [methyl- ^H] incorporation into protein 5 was only detected in response to NGF. The 
effects of NGF on protein methylation are unlikely to be due to incorporation of L-[methyl-^H]methionine 
into newly synthesized proteins because protein synthesis was inhibited by 97 ± 0.1% (« = 3) under the 
conditions of these experiments. In addition, no significant increases or decreases in the amounts of the 
specific proteins analyzed in Fig. i were detected by 2D lEF x SDS-PAGE of [^^SJmethionine-labeled 
proteins in whole cell lysates, in the absence of protein synthesis inhibition (data not shown). 

The NGF-induced changes in methyl labeling of specific proteins were not observed with the two other 
trophic factors tested. After EGF treatment for 6 h, the methylation pattern of the 68-60-kD proteins was 
quite different fi-om the NGF-induced changes relative to the control, nonstimulated cells (Fig. 1). The 
pattern of protein methylations with EGF was more similar to the pattern with control cells, but small 
increases in the methylation of proteins in this relative molecular weight range were reproducibly detected 
(B, C, D, and E). NGF, on the other hand, decreased the methylation of proteins B, D, and E. The changes 
in protein methylations were less marked with EGF than NGF and no methylated protein was detected with 
EGF that was not detected with controls. Moreover, no significant effects of EGF were detected in the 
50-kD set of methylated proteins. Insulin had no significant effects on the methylation of the 68-60- or 
50-kD proteins (data not shown). 

The effects of NGF on the pattern of protein methylation were also examined by in vitro labeling of 
proteins in subcellular fi-actions fi-om NGF-ti-eated cells as an independent approach for detecting the 
protein methylations that are affected by NGF. In vitro labeling provides a complementary method of 
analysis that does not require a protein synthesis inhibitor. It also indicates the possible cellular locations of 
protein methylation. PCI 2 cells were incubated without or with NGF for 15 min, 6 or 16 h. Nuclei (500 g), 
membranes (100,000 g, pellet), or cytosol (100,000 g, supernatant) were isolated and incubated with 
[methyl-^H]SAM for 1 h at 37°C, in vitro. Methylated proteins were resolved by 7.5-15% gradient gel 
SDS-PAGE and detected by fluorography. Changes in protein methylation were detected in nuclei, 
membranes, and cytosol (Fig. 2). Increases in protein methylation were detected after 15 min of NGF 
treatment (data not shown) and continued to increase fi-om 6 to 16 h (Fig. 2 A). Moreover, several of the 
proteins that showed increases in metiiylation have similar molecular weights as those detected by 2D 
lEF X SDS-PAGE (Fig. 1) after metabolic radiolabeling of intact cells (e.g., 68-64 kD, and 50 kD). Thus, 
changes in the methylation of proteins in response to NGF was confirmed by two independent approaches. 
In the nuclear fi-action, time-dependent increases (more than twofold) in protein methylation in response to 
NGF treatment were detected in proteins migrating at 97, 94, 67, and 64 kD. In the membrane fi-action, the 



7/23/01 3:51 PM 



J - umato et ai. us (i): lusy 



http://www.jcb.0rg/cgi/content/fuII/l 38/5/1 085 



metliylation of proteins migrating at 50 and 34 kD decreased after 6 or 16 h of NGF. Several proteins in the 
cytosolic fraction showed increased methylation. Most prominent among these were 114, 94, 50, and 
35 kD. Additional NGF-induced effects were observed by the in vitro method. This is most likely due to 
the higher specific activity of the [methyl-%]SAM pool in vitro than in the intact cell experiments and the 
enrichment of proteins by subcellular fractionation. In addition, the in vitro labeling experiments indicate 
that both the requisite methyltransferase and protein substrate were present at the time of analysis in each of 
the subcellular fractions that were isolated from NGF-activated cells. Moreover, irrespective of the specific 
mechanisms responsible for activation of the methylation of specific proteins by NGF, the state of 
activation was stable during preparation of the fractions. 



■■■i.^if' iH» 




Fig. 2. NGF-treatment of intact cells stimulates in vitro protein methylation in cell-free exfracts. (A) PC12 
cells were incubated in the presence or absence of NGF (50 ng/ml) for 6 or 16 h. Proteins in the nuclear 
(500 g, pellet), membrane (1 00,000 g, pellet), or cytosolic (100,000 g, supernatant) fractions were 
radiolabeled with L-[methyl-3H]SAM for 1 h at 37°C. The labeling reaction was quenched by the addition 
of 5 X SDS-PAGE sample buffer followed by holding the reaction tube in a boiling water bath for 5 min. 
Labeled proteins were separated by 7.5-15% gradient SDS-PAGE. The image shown is a fluorogram of the 
dried gel. Asterisks indicate methylated proteins not detectable in exfracts from non-NGF-freated confrol 
cells. Arrows identify increases of 67% or greater. The arrowheads (membrane fraction) point out decreases 
of >30%. These results were reproduced in an independent experiment. Specific changes at 16 h were as 
follows: nuclear proteins at 97 and 64 kD increase 2.4-fold; cytosolic proteins at 50 kD and 35 kD increase 
67 and 79%, respectively. (B) Cell proteins were metabolically radiolabeled followed by preparation of 
cytosol {in vivo) or the cytosol was prepared first, followed by incubation with [methyl-^HjSAM (in vitro). 
A portion of the 2D lEF x SDS-PAGE fluorograms from confrol (-NGF) versus 6 h NGF freatment 
i+NGF) are displayed. There are five individual protein spots migrating at 64 kD, which are most easily 
discerned in the in vivo +NGF condition. The 64-kD series of spots in each of the other fluorograms (four 
spots each) are superimposable with those of the in vivo +NGF condition. 
rView Larger Versions of these Images (103 + 40K GIF file)] 



To validate this approach further, cytosol was prepared from cells freated with or without NGF after 
metabolic radiolabeling, and replicate cultures without metabolic radiolabeling were also processed to 
obtain cytosol for in vifro protein methylation. Comparisons of the 2D lEF x SDS-PAGE fluorograms from 
each type of preparation indicate that several of the same proteins resolved in the 64 kD range (pi 5.8-6.4) 
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are similarly increased after NGF treatment by either method of analysis (Fig. 2 B). The in vitro data 
obtained after NGF treatment for 6 h confirm that changes in protein methylation triggered by NGF can 
occur whether or not protein synthesis has been inhibited. Thus, the in vivo and in vitro experiments 
independently indicate that NGF produces diverse, marked effects on the regulation of the methylation of 
several specific proteins. 

NGF Affects Protein Methylation during Eariy and Delayed Signaling 

The time dependence of NGF-induced changes in protein methylation was examined to determine the 
onsets and durations of changes in protein methylafion. PCI 2 cells were incubated with NGF for 15 min to 
24 h to examine protein methylation during early stages of NGF signaling and for 4 d to examine changes 
occurring concurrently with the appearance of neurites. Total protein methylation was assessed by the 
incorporation of [^HJmethyl groups into TCA-precipitable protein during NGF treatment for 24 h (Fig. 3). 
Significant increases in the total amounts of methyl group incorporation were detected within 15 min after 
the addition of NGF. The response over the early time course is biphasic with an initial rapid burst of 
methylation that decays between 1 and 4 h of NGF treatment followed by a second, persistent phase of 
elevated methylation firom 6 to 24 h. The pronounced decrease in methyl group incorporation between 
1 and 4 h indicates that the effect of NGF on total protein methylation is likely to be complex and 
dependent upon multiple factors, including methionine uptake, intracellular compartmentalization, and use 
of the labeled methyl group in other metabolic pathways. Thus, all comparisons of protein methylation 
p atterns examined by gel electrop horesis were standardized relative to total methyl-^H-labeled protein. 




Fig. 3. The time dependence of total methyl group incorporation into 
proteins in response to NGF. PC 12 cell proteins were metabolically 
labeled with L-[methyl-%]methionine (100 nCU ml) for 6 h in the 
presence of anisomycin (10 ^M). NGF (50 ng/ ml) was added to the cell 
cultures for the times indicated. Cells were lysed in SDS-PAGE sample 
buffer. Total methyl labeling of proteins (counts per minute) was 
measured for each condition after TCA precipitation of an aliquot of the sample. The relative amount of the 
H-labeled methyl groups in lysate proteins at each time point is given with respect to nontreated control 
cells. The data shown are the means ± SEM for five independent experiments. 
rView Larger Version of this Image (IQK GIF file)] 



Although the resolution of changes in protein methylation in whole cell lysates is reduced on one 

dimensional SDS-PAGE gels, this method was used to observe the time course of changes in the set of 

proteins migrating at 64-62 kD. Gradient SDS-PAGE and fluorographic analysis of methyl-^-labeled 

proteins in whole cell lysates indicated that there are increases in the methylation of the proteins in this 

region of the gel, detectable within 1 h after the addition of NGF and persisting for at least 8 h (Fig. 4 A). In 

addition, when using an 8.5-15% acrylamide gradient, a reproducible NGF-specific change in a 34-kD 

protein is evident at 3 and 6 h of NGF treatment (Fig. 4 B), but not at times earlier than 2 h (data not 
shown). 
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Fig. 4. Changes in the pattern of protein methylation are NGF specific and time dependent. (A) PC 12 cell 
proteins were metabolically labeled with L-[methyl-^H]methionine (100 |iCi/ml) for 6 h in the presence of 
anisomycin (10 ^M) with or without NGF (50 ng/ml) for the times indicated. Whole cell lysates containing 
equal TCA-precipitable cpm (300,000) were loaded in each lane. Proteins were separated by 7.5-15% 
gradient SDS-PAGE. The image shown is a fluorogram of the dried gel after exposure to x-ray film for 3 d 
at -70°C with an intensifying screen. The bracket indicates the migration position of a 64-62-kD complex 
of proteins that increase in a time-dependent manner. The arrowhead indicates the migration position of an 
invariant protein band, the density of which does not change by more than a few percent over the time 
course. By this measure, the 4-h lane is slightly overloaded, and the 6-h lane, underloaded. Nevertheless, 
the change in the 64-62-kD protein complex is still evident in the latter. The results illustrated here are 
representative of five independent experiments. (B) Metabolically radiolabeled methylated proteins 
(300,000 cpm) were obtained as described above and loaded on 8.5-15%polyacrylamide gradient gels to 
more favorably resolve the area between 29 and 36 kD. The image shown is a fluorogram derived from the 
dried gel after exposure to x-ray film for 2 d at -VO'^C. The arrow indicates the migration position of a 
regulated 34-kD protein. Cultures were treated with NGF (50 ng/ml), EGF (5 nM), or insulin (1 ^iM). The 
results shown are representative of three independent experiments. 
[View Larger Versions of these Images (92 + 47K GIF file)] 



To determine if any of the changes in methyl group incorporation persist for several days, the labeling 
studies were repeated in PC 12 cells treated with NGF for 4 d. This time point was chosen because --50% of 
PC 12 cells have neurites >20-^m long after 4 d of NGF (Greene et al., 1982EI). In addition, it was reasoned 
that some protein methylations may occur during neurite outgrowth that were not detected during early 
stages of NGF action. Thus, after 4 d of NGF, methylated proteins were analyzed by metabolic 
radiolabeling of cell proteins followed by 2D lEF x SDS-PAGE (Fig. 5) as described above. The pattern of 
[^Hjmethyl incorporation into PC 12 cell proteins relative to total methylated protein indicates that many of 
the changes induced by 6 h of NGF (Fig. I) persist for several days. Moreover, several of the NGF-induced 
increases and decreases in protein methylation observed after 6 h of NGF treatment were greater after 4 d 
of NGF treatment. Methylation of proteins between 68 and 60 kD (C, F, G, N, and N ) and in the 55-50 kD 
region (2-4, 6, and 7) increased relative to both the control (Fig. 5) and the 6 h NGF result (Fig. 1). Proteins 
C, F, G, N, and T>f appear to be the same proteins (based on pi and M^) that showed increases in 

methylation after 6 h of NGF treatment. Also, after 4 d of NGF treatment, the methylation of proteins B, D, 
and E, as well as protein 1 in the 50-kD region decreased relative to the control or 6-h NGF result. A 
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number of novel methylated proteins were also observed after 4 d of NGF treatment that were not detected 
after 6 h. These include proteins B , C in the 60-kD region of the gel, and proteins 7 , 7 , 8, and 
9, appearing in the 50-kD region of the gel. The distinctly different time courses for the methylations of 
specific proteins suggest that NGF may regulate the methylation of proteins involved in multiple signaling 
pathways. Moreover, most of the NGF- dependent changes in methylated proteins that were detected are 
not associated with transient phenomena. 

Fig. 5. NGF-induced changes in protein methylation persist and progress 
during prolonged NGF treatment. PC 12 cells were incubated with or 
without NGF (50 ng/ml) for 4 d. Cells were labeled with 
L-[methyl--^H]methionine (100 ^Ci/ml) in the presence of anisomycin 
(10 ^M) for 6 h. Equal TCA-precipitable cpm (750,000) of cell lysates 
were subjected to lEF. ffiF gels were loaded on 7.5-15% gradient 
SDS-PAGE gels to separate methylated proteins by molecular mass. The 
images shown are fluorograms of the dried gels after exposure of x-ray 
film for 12 d at -70*^C with an intensifying screen. The areas of the gels 
shown are similar to the boxed areas shown in Fig. i. The pH gradient of 
the lEF gels is indicated at the top of the figure. Labels are as in Fig. i. 
[View Larger Version of this hnage (90K GIF file)] 



Inhibition of Methylation Inhibits Neurite Outgrowth 

The marked effects of NGF on the methylation of specific proteins calls attenfion to the possibility that 
protein methylation is required for NGF signal transduction and neurite outgrowth. This possibility was 
examined by observing NGF-mediated neurite outgrowth after inhibiting methylation using two approaches 
based on the biochemical pathway outlined in Fig. 6. One approach was inhibition of SAHH by a 
mechanism-based, substrate analogue inhibitor, DHCA (Liu et al., 1992H). SAHH catalyzes the reversible 
hydrolysis of SAHcy to homocysteine and adenosine by an NAD-dependent mechanism (De la Haba and 
Cantoni, 1959H; Liu et al, 1992E). The reaction is driven in the hydrolysis direcfion in vivo by the 
adenosine deaminase-catalyzed conversion of adenosine to inosine. DHCA blocks the methylation pathway 
by inhibifing SAHH. Like the normal substrate, SAHcy, DHCA is oxidized by SAHH coupled to NAD"^ 
reduction, but the 3-keto DHCA formed cannot be hydrolyzed, thereby trapping SAHH in its inactive, 
NADH fomi (Liu et al., 1992H). SAHH inhibition results in an accumulafion of its substrate, SAHcy, a 
potent compefitive inhibitor of methyltransferases (Hildeshein et al., 1972l±(). 
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S-adenosylhomocysteine (SAH) 



Fig. 6. The methylation pathway. SAHH catalyzes the reversible 
hydrolysis of SAHcy. Inhibition of SAHH by DHCA or homocysteine 
leads to an accumulation of SAHcy and inhibition of methyltransferases. 



rView Larger Version of this Image (20K GIF file)! 



The concentration dependence for inhibition of neurite outgrowth by DHCA (Fig. 7 A) was determined by 

incubating PC 12 cells with NGF for 7 d with or without 10 nM to 3 nM DHCA. Neurite outgrowth was 

scored after 7 d of NGF treatment as described under Materials and Methods. The concentration of DHCA 

required to inhibit neurite outgrowth by 50% was -100 nM. A second approach that was used to test for a 

requirement of protein methylation in neurite outgrowth was inhibition of SAHH by addition of 

extracellular homocysteine, alone or in combination with DHCA. When intracellular levels of 

homocysteine rise, the SAHH-catalyzed reaction is driven in the synthesis direction (De la Haba and 

Cantoni, 19590; Hasobe et al., 19890), i.e., towards the formation of SAHcy (Fig. 6). Excess homocysteine 

acts cooperatively with inhibitors of SAHH in many other cell types (Chiang et al., 1977Et Kredich and 

Martin, 1977I±t Backlund et al., 1986t±t Hasobe et al., 19890), and the combination of the two is expected 

to elicit more inhibition of methylation than either alone. As shown in Fig. 7 5, 100 nM DHCA reduced 

NGF-induced neurite outgrowth to 59% of control (NGF alone), and 100 ^iM homocysteine reduced 

NGF-induced neurite outgrowth to 42%. The combination of 100 nM DHCA plus 100 nM homocysteine 

inhibited the neurite outgrowth by more than 80%. The findings that homocysteine alone inhibits neurite 

outgrowth, and that homocysteine enhances the inhibitory effect of a low dose of DHCA on neurite 

outgrowth indicate that the effect of DHCA on neurite outgrowth is most likely due to inhibition of 
methylation. 




Fig. 7. Concentration-dependent inhibition of neurite outgrowth by inhibitors of methylation. {A) Effect of 
DHCA. PC12 cells on 35-mm collagen-coated tissue culture dishes (300,000 cells per dish) were treated 
with NGF (50 ng/ml), or NGF and DHCA (10 nM to 3 ^M) for 7 d. Neurite-bearing cells were scored as 
described under Materials and Methods. Cultures treated with NGF and no DHCA received the DMSO 
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vehicle only, which had no effect on neurite outgrowth. The line was fit by least squares regression 
analysis. The data shown are the means ± SD for three independent experiments. (B) Effect of 
homocysteine alone and in combination with DHCA. All cultures were treated with NGF (50 ng/ml) and 
scored for the presence of neurites after 7 d. The data shown are the means ± SD for three independent 
experiments. The white bar represents the NGF control condition without DHCA or homocysteine. Results 
from cultures treated with 100 nM DHCA are represented by the adjacent striped bar (DHCA), and those 
from cultures treated with 100 nM homocysteine in the next striped bar (Hey), The cross-hatched bar 
indicates results from cultures treated with 100 nM DHCA plus 100 |iM homocysteine (DHCA + Hey). 
rview Larger Versions of these Images (8 + 18K GIF file)] 



Fig. 8 illustrates the effects of DHCA or DHCA plus homocysteine on cell morphology observed by phase 
contrast microscopy of NGF-treated PC12 cells. After 7 d of NGF-treatment, PC12 cells flatten, 
hypertrophy, and elaborate long neurites (Fig. 8 5). In the presence of 1 ^M DHCA and NGF for 7 d, the 
cells continue to respond to NGF by flattening, but neurite outgrowth is greatly attenuated (Fig. 8 Q. Short, 
spike-like processes are observed. In the presence of 100 nM DHCA plus 100 \iM homocysteine, the cells 
appear more flattened than control cells, but little neurite outgrowth is observed (Fig. 8 D), Thus, inhibition 
of methylation by DHCA, homocysteine, or the cooperative action of the two together suggest that 
methylation is an important requirement for NGF-induced neurite outgrowth. In addition, the 
morphological evidence (Fig. 8) indicates that not all NGF actions (e.g., the cell-flattening response) are 
inhibited by factors that inhibit methylation (see below). 




Fig. 8. The effects of DHCA and homocysteine on neurite outgrowth 
and cell morphology. PC 12 cells were plated on 35-mm collagen-coated 
tissue culture dishes at a density of 300,000 cells per dish. Cells were 
cultured for 7 d under control conditions without NGF (A), with 
50 ng/ml NGF (5), NGF plus 1 ^iM DHCA (Q, or NGF with 100 nM 
DHCA and 100 ^M homocysteine (Z)). Bar, 50 ^m. 
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DHCA-induced inhibition of S AHH was verified by direct measurement of SAHH activity. DHCA at 1 nM 
produced 70% inhibition and >90% inhibition was achieved at 10 nM. The consequence of this inhibition 
(elevated intracellular level of SAHcy) is expected to inhibit protein methylation (Fig. 6). Thus, the effect 
of DHCA on protein methylation was directly determined by assessing protein methylation in PC 12 cells 
using metabolic radiolabeling with L-[methyl-^H]methionine in the presence of anisomycin. The 
dose-dependent inhibition of protein methylation observed is similar to that of the DHCA effect on neurite 
outgrowth (Fig. 9). DHCA (1 pM) inhibits total methyl group incorporation measured by 
TCA-precipitation of equal aliquots of whole cell lysates as described under Materials and Methods. The 
total reduction in radiolabeled, methylated protein was 40% in control, non-NGF-treated cells, and 52% in 
cells stimulated with NGF for 6 h. Similar differences in the inhibitory effects (±NGF) were observed at all 
doses of DHCA above 30 nM (data not shown). The inhibitory action of DHCA is thus greater in 
NGF-treated cells. In conclusion, these experiments indicate that protein methylation is markedly inhibited 
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Fig. 9. Dose-dependent effect of DHCA on total protein methylation in 
j - ^^^^..."^^ r^^ PC 12 cells correlates with the DHCA effect on inhibition of neurite 

^ - ^^^'^^^^ outgrowth. Inhibition of NGF-stimulated neurite outgrowth (■) by 

r'"^'^ DHCA was compared to the inhibition of total protein methylation (o) at 

.1 ^ the same doses of DHCA. PC 12 cells were treated with DHCA (30 nM 

to 3 i^M) and NGF (50 ng/ ml) for 6 h in the presence of anisomycin to 
determine total protein methylation as described under Materials and Methods. Inhibition is plotted relative 
to control cultures treated with NGF only. The neurite outgrowth data are replotted from Fig. 7 A. 
rView Larger Version of this Image (8K GIF file)] 



In contrast, neither NGF alone nor DHCA with or without NGF have a significant effect on total 
methylation of cytosines in PC 12 cell DNA. The percentage of total cytosines methylated in NGF-treated 
cells (53.3 ± 1 .5) versus that in cells treated with NGF plus DHCA (48. 1 ± 0.2) was not statistically 
significant (w = 3, Student's paired / test; P > 0.05, two tails). All measurements were performed after 24-h 
treatments. 

DHCA Treatment Does Not Interfere with Cell Proliferation or Early NGF-mediated Signaling 
Events 

Previous efforts to examine the possible role of methylation in NGF signal transduction (Seely et al., 1984; 

Haklai et al., 1993Et Kujubu et al., 1993111) used methylation inhibitors at relatively high concentrations 

that can produce cytotoxicity (Seeley et al., 19840) and inhibition of receptor tyrosine kinases (Meakin and 

Shooter, 1991Et Maher 1993ED. The methylation inhibitor used in this study, DHCA, is much less toxic 

than the related adenosine analogues used previously and is effective at concentrations four orders of 

magnitude lower than those used previously (Seeley et al., 1984t±t Meakin and Shooter, 1991[±t Haklai et 

al., 1993S; Kujubu et al., 1993t±t Maher, 1993(11). DHCA does not affect the growth rate of PC12 cells in 

serum-containing medium (Fig. \0A). In the absence of serum, PC 12 cells degenerate via programmed cell 

death. DHCA has no effect on the rate that cell death is induced (Fig. 10.4). In the absence of serum, NGF 

rescues PC12 cells from programmed cell death (Rukenstein et al., 1991[±I). Also shown in Fig. 10^4, 

DHCA had no effect on the protection afforded by NGF. In addition to confirming the absence of 

cytotoxicity, these findings suggest that DHCA does not interfere with the ability of NGF to activate its 

tyrosine kinase receptor (Trk), which prevents programmed cell death (Kaplan and Stephens, 19940). Thus 

DHCA has no detectable effects on the grov^^h or survival of PC 12 cells under any of the conditions 
examined. 
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Fig. 10. DHCA treatment does not interfere with cell growth, NGF-mediated survival, or rapid onset 
NGF-stimulated phosphorylations. {A) PC 12 cells were plated on 24-well collagen-coated dishes at a 
density of 100,000 cells per well. Cells were allowed to attach to the dishes overnight, and then were 
washed with RPMI-1640 medium to remove serum. Cells were then cultured in complete 
(serum-containing) medium (CM, o and •), plus or minus 1 DHCA; RPMI-1640 without serum plus 
50 ng/ml NGF, plus or minus 1 nM DHCA {NGF, □ and ■); RPMI-1640 medium without serum or NGF, 
plus or minus 1 nM DHCA (No serum, A and A). Similar results were observed in two other independent 
experiments. (B) Effect on Trk phosphorylation. PC 12 cells were treated with or without NGF (50 ng/ml) 
for 5 min. Where indicated, 1 ^M DHCA was added to cultures for 1 h before treatment with NGF. Cell 
lysates were equalized for total protein, immunoprecipitated with anti-phosphotyrosine antibody (as 
described under Materials and Methods), resolved by 7.5% SDS-PAGE and transferred to Immobilon P 
membranes. The blot was probed with anti-Trk antibody, followed by donkey anti- rabbit '^^I-IgG. The 
image shown is an autoradiogram of the Western blot after a 5-d exposure at -80°C. This result was 
verified in a second independent experiment. (Q PC12 cells were treated with or without NGF (50 ng/ml) 
for 2 h. DHCA (1 ^M) was added 1 h before the addition of NGF where indicated. Cells were radiolabeled 
with [ PJorthophosphate for 2 h as described under Materials and Methods. For long term NGF-induced 
phosphorylations, PC 12 cells were grown in the presence of NGF (50 ng/ml) plus or minus 1 nM DHCA 
for 10 d. Cells were then radiolabeled with [^^PJorthophosphate for 2 h. For both short and long term NGF 
treatment experiments, cell lysates were equalized for total TCA-precipitable ^^P-labeled protein and 
separated on 6-12% gradient SDS-PAGE. The image shown is an autoradiogram of the dried gel. Exposure 
time was 16 h. The arrows at tlie right indicate, in decreasing the positions of the previously identified 

phosphoproteins, 64-kD chartin, 60-kD tyrosine hydroxylase, and 55-kD P-tubulin. 
rView Larger Versions of these Images f 12 + 62 + 40K GIF file)] 



Since other adenosine-analogues, which are putative methylation inhibitors, are known to inhibit 
phosphorylation and activation of Trk (Meakin and Shooter, 19910; Maher 1993E1), the effect of DHCA on 
the tyrosine phosphorylation of Trk was also determined. PC 12 cells were incubated with or without 1 nM 
DHCA for 1 h, and NGF was then added for 5 min. Immunoprecipitation with a phosphotyrosine antibody 
was followed by SDS-PAGE and immunoblotting with anti-Trk antibody. DHCA had no detectable effect 
on the Trk tyrosine phosphorylation mediated by NGF (Fig. 10 B), the primary event in NGF signaling. 

To determine if DHCA affects other phosphorylation events downstream of Trk activation, total 
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phosphoproteins were analyzed by SDS-PAGE. Several NGF-induced increases in phosphorylation of 
proteins are resolved by this method (Fig. 10 Q. The rapid increase in the phosphorylation of tyrosine 
hydroxylase (Halegoua and Patrick, 1980S) that occurs after NGF treatment is not affected by DHCA. The 
overall phosphorylation pattern was similarly unaffected by DHCA. Two other NGF-induced increases in 
phosphorylation, which require long-term (7 d) NGF treatment, are temporally associated with neurite 
outgrowth. Unlike phosphorylation of tyrosine hydroxylase, the increased ^^P incorporation into the 64-kD 
chartin (Aletta and Greene, 1987E) and P-tubulin (Aletta, 1996(±t Black et al., 19860) is diminished by 
exposure of the cells to DHCA. The inhibitory effect of DHCA on the phosphorylations of these two 
proteins is the predicted consequence of DHCA inhibition of neurite outgrowth. Thus, DHCA affects 
delayed phosphorylation events in NGF signaling but does not inhibit phosphorylations associated with the 
early signaling events studied here. 

DHCA Effects on Neurite Outgrowth Are Diminished by prior NGF Treatment and Are Rapidly 
Reversible 

To explore further the functional significance of the time dependence of protein methylation during 
NGF-mediated neurite outgrowth, DHCA (1 ^M) was added to PC 12 cells at the same time that NGF was 
added or 1-6 d after NGF addition. Under these conditions, the later DHCA was added after NGF, the less 
its inhibitory effect (Fig. ii A), The decrease in inhibition showed a linear dependence on the time at which 
DHCA was added to the cells. Once the NGF-signaling cascade begins, neurite outgrowth is progressively 
less sensitive to inhibition of methylation. This would occur if the rates of demethylation were low for 
those methylated proteins that are required for neurite outgrowth. Alternatively, neurite outgrowth may be 
insensitive to inhibiting methylation once neurite outgrowth proceeds past a critical, committed step. While 
initiation of neurite outgrowth from PC 12 cells after exposure to NGF is slow (requiring 7 d to achieve 
-100% neurite bearing cells), PC 12 cells can rapidly regenerate after preformed neurites are disrupted as 
described under Materials and Methods. Fig. IT 5 shows the concentration dependence of inhibition of 
neurite regeneration by DHCA. Neurite regeneration was inhibited 40% by 3 ^M DHCA. In contrast, if 3 
^M DHCA is used when NGF is first added to the cells, it inhibits neurite outgrowth by 84% (Fig. 7 A). As 
described above, the difference in the sensitivities of regeneration and de novo neurite outgrowth to 
inhibition by DHCA is consistent with slow rates of demethylation or decreased sensitivity to DHCA once 
differentiation has reached a critical stage. 



Fig. 11. The effect of prior NGF treatment on the neurite inhibiting action of DHCA. (A) The effect of 
varying the time interval between NGF treatment and addition of DHCA. PC 12 cells were treated with 
NGF (50 ng/ml) for 7 d. DHCA (1 nM) was added to the cells either at the time NGF was initially added to 
the cell cultures (time zero), or after a time interval of 1, 2, 3, 4, 5, or 6 d. Cells were scored for the 
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presence of neurites after 7 d of NGF. The data shown are the means ± SD for three independent 
experiments. {B) The effect of inhibition of methylation on neurite regeneration. PC 12 cells were grown in 
NGF (50 ng/ml) for 7-14 d. NGF was removed from the cultures by an extensive washing procedure 
described under Materials and Methods. Cells were removed from the culture dishes by trituration and 
replated without or with DHCA (3 nM to 3 ^iM). NGF was added to all cultures 1 h after plating. Neurite 
regeneration was scored 16-24 h later. The line was fit by least squares regression analysis. The data shown 
are the means ± SD for three independent experiments. 
[View Larger Versions of these Images (10 + 8K GIF file)] 



The onset of neurite outgrowth after DHCA is removed was examined to compare the rate of reversing the 
effect of DHCA on neurite outgrowth to the rate of NGF- induced neurite outgrowth when DHCA had 
never been present. In addition, removal of DHCA was used to detect protein methylations that are 
associated with neurite outgrowth during the reversal period. Reversibility of the effects of DHCA on 
neurite outgrowth and protein methylation was assessed by incubating PC 12 cells with NGF and DHCA for 
10 d, followed by removal of the DHCA. Interestingly, the inhibitory effect of DHCA on neurite outgrowth 
was rapidly reversed (Fig. 12 A). No lag was observed before an increase in neurite outgrowth was 
detected. The for reversal was 12 h (Fig. 12 The rate of neurite outgrowth when DHCA is removed 
is, therefore, about eightfold faster than the normal for neurite outgrowth when NGF is added to naive 
cells. Thus, the NGF-dependent signaling events that prime the cells (Burstein and Greene, 197a±t Greene 
et al., 1 9820) for neurite outgrowth were not inhibited greatly by the methylation inhibitor, DHCA. 




Fig. 12. Reversal of DHCA-induced inhibition of methyltransferases results in rapid neurite outgrowth and 
a time-dependent increase in protein methylation. (A) PC12 cells were plated on 35-mm collagen-coated 
tissue culture dishes at a density of 300,000 cells per dish in NGF (50 ng/ ml) plus or minus 1 \iM DHCA 
for 7 d. The medium was then changed repeatedly to remove DHCA, and neurite outgrowth was scored and 
plotted over a 3-d period. Controls were maintained in DHCA/NGF (A) and NGF alone (•). The 
experimental condition was treatment with DHCA and NGF for 7 d, followed by NGF alone for 3 d (■). 
The data are normalized to 100% relative to the number of neurite bearing cells present in cultures treated 
with NGF alone. The data shown are the means ± SD for three independent experiments. The line shown 
for the reversal data is the best fit to a single exponential fimction with a tj/2 of 12 h. After DHCA removal, 
the protein methylation pattern was also assessed by SDS-PAGE. (B) Cells were harvested after 10 d of 



7/23/01 3:51 PM 



:J - Umato et al. iispj: iu»y 



http://wwwjcb.org/cgi/content/tull/ 138/:)/ 1 081 



DHCA/NGF treatment or after 7 d DHCA/NGF treatment followed by progressively longer times (-h) in 
the absence of DHCA. Cell lysates were equalized for total TCA-precipitable counts per minute and 
separated on 7.5-15% gradient SDS-PAGE. The figure shown is a fluorogram of the dried gel. Exposure 
time was 5 d. The arrows and brackets indicate the positions of proteins that increased in methylation by 
25% or more relative to the 10 d DHCA lane. Asterisks mark the position of protein bands not detected in 
the 10-d DHCA lane. The changes noted during the reversal period were replicated in four independent 
experiments. 

["View Larger Versions of these hnages (13 + 76K GIF file)] 



Upon removal of DHCA, the rapid onset of neurite outgrowth was paralleled by rapid reversal of its 
inhibitory effects on the methylation of several proteins. Proteins that increase by 25% or more in 
methylation upon removal of DHCA are indicated by the arrows, brackets, and asterisks shown in Fig. J_2 
B, The molecular weights of proteins that showed increased methylation migrate at 150, 120, 94, 70/68, 
50, 48/47, 40, and 35 kD. The time dependencies for methylation were different for different proteins. 
These results, together with the time course (Figs. 4 and 5), inhibitor studies (Figs. 7-10 ). and specificity of 
the NGF- induced effects on protein methylation (Fig. 1) indicate that regulation of protein methylation is 
required for neurite outgrowth. 



Discussion ** 

The Subset of Methylated Proteins that Are NGF-reguIated 

This work extends previous studies (Seeley et al., 1984[±t Kujubu et al, 1993E) of the role of methylation 
in growth factor signaling mechanisms and further demonstrates the feasibility of exploring this role by 
metabolic radiolabeling of specific proteins in intact cells and by in vitro labeling of proteins in cell 
fi'actions. The results indicate that protein methylation is involved in the NGF-induced signal transduction 
that leads to neurite outgrowth. Methylation of numerous proteins was detected by metabolic radiolabeling 
indicating that protein methylation is an ongoing, active process. NGF specifically affects the methylation 
of a subset of these proteins. Further evidence for a significant role for protein methylation in neurite 
outgrowth is the marked concurrent inhibitory effects of DHCA on protein methylation and neurite 
outgrowth. Moreover, removal of DHCA rapidly leads to concomitant increases in protein methylation and 
neurite outgrowth. The identities and functions of the NGF-regulated proteins remain to be determined. It is 
likely that some of the proteins identified by this approach represent previously unidentified players in NGF 
signal transduction. This is significant because specific protein modifications that are temporally coincident 
with neuronal differentiation may be useful in the elucidation of the mechanism of action of NGF (Chao, 
1992H; Kaplan and Stephens, 19940). Thus, signal transduction pathways may be regulated in part by 
regulating the methylation of specific proteins in much the same manner as protein phosphorylation 
regulates signaling pathways. 

Potential Actions of Protein Methylation During NGF Signal Transduction 

The data presented here focus on NGF actions on protein methylations rather than methylation of 
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phospholipids, RNA or DNA. Methylation of genes serves as a general signal for repression of 
transcription (Razin and Cedar, 199 IS). Hypomethylation of the genome is associated with cellular 
differentiation in vitro (Bestor et al., 1984[±I). There were no significant effects of NGF and/or DHCA on 
total DNA cytosine methylation in PC 12 cells. Thus, the potent inhibitory action of DHCA on neurite 
outgrowth is unlikely to be due to an effect on DNA methylation. Previous work also indicates that NGF 
does not detectably affect phospholipid methylation (Ferrari and Greene, 1985S). These authors further 
demonstrated that complete inhibition of phospholipid methylation is achieved at concentrations of 
deaza-adenosine and homocysteine thiolactone that do not block neurite outgrowth. 

There are several potential mechanisms through which NGF-induced changes in protein methylation may 
affect biological responses. Gene regulation may be affected, particularly if the modified proteins are 
involved in gene expression. For example, many heterogeneous nuclear RNPs that associate with 
pre-mRNAs are methylated on arginine residues (Liu and Dreyfuss, 1995B!). Modulation of RNA splicing 
has been proposed as one potential target for protein methylation in signal transduction (Lin et al., 19960). 
Consistent with this view, a protein arginine methyltransferase activity appears to be constitutively 
associated with the type I interferon receptor (Abramovich et al., 19970). A role for protein methylation in 
cell signaling is further substantiated by the finding that methylation of the Y subunit of a heterotrimeric 
G-protein enhances activation of Pl-specific phospholipase C and PI3-kinase in vitro (Parish et al., 19950). 
If this biochemical effect occurs in intact cells, protein methylation could play an important general role in 
the amplification of intracellular signals. Protein methylation may also be involved in altering the 
subcellular localization of specific proteins, as recently suggested for the cell cycle-dependent methylation 
of protein phosphatase 2A (Turowski et al., 19950). 

Neurite outgrowth requires a fiindamental reorganization of the cytoskeleton including formation of 
intermediate filaments and parallel bundles of microtubules oriented longitudinally in neurites, as well as 
specialized actin microfilament arrays within motile growth cones. Inhibition of neurite outgrowth by 
DHCA maybe related to inhibition of protein methylation, which affects the cytoskeleton. For example, 
Ras and Rho proteins, modified by isoprenylation and methylation, are involved in redistribution of actin 
microfilaments during changes in cell shape (Vojtek and Cooper, 19950). Recently, Luo et al. (1996)0 
presented evidence that constitutively active Racl in transgenic mice interferes with the normal formation 
of Purkinje cell axons and dendrites. Since only the methylation step of small G-protein processing is 
reversible (Rando, 19960), methylation is more likely than other processing steps to be subject to 
physiological regulation. NGF-induced increased methyl labeling of G-proteins in PC12 cells (Haklai et al., 
1993Et Kujubu et al., 19930) and the specific proteins observed in this study indicate that NGF may serve 
as one mediator of regulatory control in responsive neurons. Furthermore, Klein and colleagues have raised 
the possibility that protein methylation plays an important role in neurulation (Coelho and Klein, 19903 
Moephuli et al., 19970) and neurite outgrowth in rat embryo cultures (Coelho and Klein, 19900). The 
present work lends additional support to this possibility, especially with regard to neurite outgrowth. 

DHCA Is a Potent and Nontoxic Inhibitor of Protein Methylation 

DHCA is a promising tool for investigating the possible roles of methylation in signal transduction 
mechanisms. This inhibitor was developed by Liu and colleagues to obtain a more specific, less toxic 
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inhibitor of SAHH than previously used inhibitors (Liu et al., 1992S). The 50% effective dosage (ED^q) for 

the effect of DHCA on neurite outgrowth for PC12 cells (-100 nM) is four orders of magnitude lower than 
previously used inhibitors (Seeley et al., 1984111). Inhibition of neurite outgrowth by >-S5% is difficult to 
achieve, even at the highest concentration of the inhibitor. Pretreatment of cells with DHCA for several 
weeks before NGF stimulation, however, does produce nearly complete inhibition even at 7 d of NGF 
(Cimato, T.R., M.J. Ettinger, and J.M. Aletta, unpublished observations). These observations provisionally 
indicate that basal, ongoing protein methylation may serve an auxiliary role in neurite outgrov^h in addition 
to the NGF effects on protein methylation. As expected, DHCA potently inhibits SAHH activity (Hasobe et 
al., 19890) and consequently leads to decreased total protein methylation. The dose-dependent inhibition of 
protein methylation observed is quite similar to that for inhibition of neurite outgrowth (Fig. 9). The 
cooperative effects of homocysteine and DHCA on neurite outgrowth further substantiate a specific effect 
on methylation. The finding that total protein methylation was inhibited more when NGF was present than 
when absent, suggests that under the influence of NGF, many of the potential methyltransferases involved 
in the NGF response may be more sensitive to inhibition due to increased levels of SAHcy. This may occur 
because, as a result of increased total protein methylation (Fig. 3), intracellular levels of SAHcy are 
expected to be elevated even before DHCA treatment. NGF has no measureable effect on SAHH activity 
(Cimato, T.R., M.J. Ettinger, and J.M. Aletta, unpublished resuhs). 

DHCA is unique among methylation inhibitors because of its lack of effect on cell flattening, 
phosphorylation of tyrosine hydroxylase, and tyrosine phosphorylation of Trk. Less specific methylation 
inhibitors, used to study the role of methylation in PC12 cells previously (Seeley et al., 1986; Kujubu et al., 
1993H; Maher, 19930; Meakin and Shooter, 1995), blocked each of these actions. The most remarkable 
property of DHCA in comparison to previously used inhibitors is that DHCA inhibits neurite outgrowth 
without inhibiting the NGF priming that prepares PC 12 cells for neurite outgrowth (Greene et al., 19820). 
This is best illustrated by the rapid reversal of DHCA effects on neurite outgrowth, which is in marked 
contrast to that observed with other methylation inhibitors. Reversal of inhibition by 5 -deoxy-S-methyl 
adenosine requires the same 7 d as required by control PC 12 cells when first exposed to NGF (Seeley et al., 
1986). Thus, DHCA is clearly more specific than previously used methylation inhibitors. DHCA 
specifically affects neurite outgrowth without affecting many other signaling effects of NGF, including 
those associated with priming. 

Protein Methylation in Early and Delayed NGF Signaling 

Grov^h factors in general, promote biological responses that are either extremely rapid (within minutes) or 
require many hours or days to develop fully. The diversity of signaling networks (Pawson, 1985I±t Hunter, 
1995E) and the propensity of specific pathways to transmit and/or sustain a given signal (Marshall, 1995H) 
may help to explain the different temporal characteristics of responses. The results of this study and other 
recent work (Metz et aL, 1993(3 Philips et al., 1993E, 1995S) indicate that posttranslational modification 
of proteins by methylation offers another potential mechanism for regulating both early and delayed growth 
factor signaling. 

The importance of early and delayed responses to NGF in PC 12 cells, dorsal root ganglion, and 
sympathetic neurons is well documented (Greene, 1984(3). One of the most obvious examples of a delayed 
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NGF response is neurite outgrowth. Little neurite outgrowth is evident in PC 12 cells treated with NGF for 
12-24 h. 50% of the cells extend neurites within 4 d, and nearly all have neurites by 7 d. NGF-mediated 
neurite outgrowth requires transcription (Burstein and Greene, 1978l±t Greene et al., 1982[±!). The long 
latency is apparently due to the progressive accumulation of specific proteins required for neurite assembly 
during the priming period. This accumulation may depend on protein synthesis (Burstein and Greene, 1978 
H) and/or progressive posttranslational modifications of specific proteins (e.g., as shown here for the 
methylation of specific proteins). Since regeneration of neurites from PC12 cells previously treated with 
NGF occurs much more rapidly (within 24 h) than initiation (7 d); the longer time required for neurite 
outgrowth in the latter is thought to be due to the "priming" events. 

When interpreted in light of the priming model of NGF-induced neurite outgrowth, the experiments that 
used DHCA provide important insights regarding the temporal nature of protein methylation involved in 
this NGF response. Inhibition of protein methylation by DHCA is most effective in reducing neurite 
formation when the DHCA is present concomitantly with NGF treatment. There is decreased inhibition of 
neurite outgrowth as the time interval between NGF treatment and the addition of DHCA increases (Fig. U 
A). DHCA is also less effective at inhibiting neurite regeneration (Fig. U. B). These results imply 
progressive accumulation of the methylated protein product(s) necessary for neurite outgrowth, as was 
detected at 4 d of NGF treatment (Fig. 5). Slow turnover of methylated proteins is consistent with the 
reduced effect of DHCA when added after NGF signaling has been initiated. 

The time dependencies of specific protein methylations, the effects of DHCA on protein methylation, and 
the lack of effects of DHCA on early protein phosphorylations and priming are consistent with little or no 
interdependence between protein methylation signaling events and early, priming events. Moreover, the 
data presented on the phosphorylation of Trk and other proteins that lie downstream of Trk activation (Fig. 
10, 5 and C) further substantiate the view that some cellular signaling events are more susceptible to 
interference from inhibition of protein methylation than others. Autophosphorylation of Trk and the rapid 
increase in the phosphorylation of tyrosine hydroxylase (Halegoua and Patrick, 19803) are not affected by 
DHCA. Two other NGF-induced increases in phosphorylation, which require long term ^7 d) NGF 
treatment for maximal phosphorylation, are temporally associated with neurite outgrowth. In accord with 
DHCA-mediated inhibition of neurite outgrowth, but unlike the rapid, early phosphorylation events, the 
increased ^^p incorporation into the 64.kD chartin (Aletta and Greene, 1987H) and P-tubulin (Black et a!., 
198Q±t Aletta, 1996H) is diminished by exposure of the cells to DHCA. 

It can thus be concluded that DHCA does not disrupt all early NGF signaling events and priming actions, 

nor does it abrogate many of the biological effects of NGF signaling (e.g.. Fig. 10 A, survival; Fig. 9, cell 

flattening). The effects of DHCA on neurite outgrowth and inhibition of the phosphorylation of 

neurite-associated proteins (P-tubulin and chartin) are consistent with a selective inhibitory action on an 

NGF pathway responsible for neurite outgrowth. Based on all of the above results, the protein methylation 

required is likely to occur downstream from, or parallel to, rapid NGF actions and priming events. This 

interpretation does not exclude the possibility that protein methylation of rapid onset also plays an 

important role in neurite outgrowth. Once priming is achieved, NGF stimulation of some of the rapid 

protein methylations observed in this study may assume added significance, particularly if they occur at the 
growth cone. 
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tyrosine kinase receptor. 



References ^ 

1. Abramovich, C, B. Yakobson, J. Chebath, and M. Revel. 1997. A protein-arginine methyltransferase 
binds to the intracytoplasmic domain of the IFNARl chain in the type I interferon receptor. EMBO 
(Eur. Mol. Biol Organ.) J. 16: 260-266 [Abstract/Fuil Text1 . 

2. Aletta, J.M.. 1996. Phosphorylation of type m P-tubulin in PC12 cell neurites during NGF-induced 
process outgrowth. J. Neurobiol 31: 461-475 [Medline] . 

3. Aletta, J.M., and L.A. Greene. 1987. Sequential phosphorylation of chartin microtubuie-associated 
proteins is regulated by the presence of microtubules. J. Cell Biol 105: 277-290 [Abstract] . 

4. Backlund, P.S. Jr., D. Carotti, and G.L. Cantoni. 1986. Effects of the S-adenosylhomocysteine 
hydrolase inhibitors 3-deazaadenosine and 3-deazaaristeromycin on RNA methylation and synthesis. 
Eur. J. Biochem. 160: 245-251 [Abstract] . 

5. Bestor, T.H., S.B. Hellewell, and V.M. Ingram. 1984. Differentiation of two mouse cell lines is 
associated with hypomethylation of their genomes. Mol Cell Biol 4: 1800-1806 [Medlinel . 

6. Black, M.M., J.M. Aletta, Jind L.A. Greene. 1986. Regulation of microtubule composition and 
stability during nerve growth factor-promoted neurite outgrowth. /. Cell Biol 103: 545-557 
[Abstract! . 

7. Bonner, W.M., and R.A. Laskey. 1974. A film detection method for tritium- labeled proteins and 
nucleic acids in polyacrylamide gels. Eur. J. Biochem. 46: 83-88 [Medline] . 

8. Bradford, M.M.. 1976. A rapid and sensitive method for the quantitation of microgram quantities of 
protein utilizing the principle of protein-dye binding. Anal Biochem. 72: 248-254 [Medlinel . 

9. Burstein, D.E., and L.A. Greene. 1978. Evidence for RNA synthesis-dependent and independent 



f27 



7/23/01 3:51 PM 



J~*;imatoetal. 138 p): lU«y 



http://\vwwjcb.org/cgi/content/hjll/ 138/5/ 1 Olii? 



pathways in stimulation of neurite outgrowth by nerve growth factor. Proc. Natl Acad. Sci. USA. 75: 
6059-6063 fMedlinel 

10. Chamberlain, J.P.. 1979. Fluorographic detection of radioactivity in polyacrylamide gels with the 
water-soluble fluor, sodium salicylate. Anal. Biochem. 98: 132-135 [Medlinel . 

11. Chao, M.V.. 1992. Growth factor signaling: where is the specificity? Cell. 68: 995-997 fMedlinel . 

12. Chelsky, D., B. Ruskin, and D.E. Koshland. 1985. Methyl-esterified proteins in a mammalian cell 
line. Biochemistry. 24: 6651-6658 FMedlinel . 

13. Chiang, P.K., H.H. Richards, and G.L. Cantoni. 1977, S-adenosyl-L-homocysteine hydrolase: 
analogues of S-adenosyl-L-homocysteine as potential inhibitors. Mol. Pharmacol. 13: 939-947 
[Abstract] . 

14. Coelho, C.N.D., and N.W. Klein. 1990. Methionine and neural tube closure in cultured rat embryos: 
morphological and biochemical analyses. Teratology. 42: 437-451 [Medhnel . 

15. De la Haba, G., and G.L. Cantoni. 1959. The enzymatic synthesis of S-adenosyl-L-homocysteine from 
adenosine and homocysteine. J. Biol. Chem. 234: 603-608 . 

16. Favre, B., S. Zolnierowicz, P. Turowski, and B.A. Hemmings. 1994. The catalytic subunit of protein 
phosphatase 2 A is carboxyl-methylated in vivo. J. Biol. Chem. 269: 1631 1-16317 [Abstract] . 

17. Ferrari, G., and L.A. Greene. 1985. Does phospholipid methylation play a role in the primary 
mechanism of action of nerve grov^h factor? /. Neurochem. 45: 853-859 [Abstract] . 

18. Greene, L.A.. 1984. The importance of both early and delayed responses in the biological actions of 
nerve growth factor. Trends NeuroscL 13: 91-94 . 

19. Greene, L.A., and A.S. Tischler. 1976. Establishment of a noradrenergic clonal hne of rat adrenal 
pheochromocytoma cells which respond to nerve grov/th factor. Proc, Natl. Acad. Sci, USA. 73: 
2424-2428 [Medline] . 

20. Greene, L.A, D.E. Burstein, and M.M. Black. 1982. The role of transcription-dependent priming in 
nerve growth factor promoted neurite outgrovrth. Dev. Biol. 91: 305-316 [Medline] . 

21. Greene, L.A., J.M. Aletta, A. Rukenstein, and S.H. Green. 1987. PC12 pheochromocytoma cells: 
culture, nerve growth factor treatment, and experimental exploitation. Methods Enzymol 147: 
207-216 [Medline] . 

22. Haklai, R., S. Lemer, and Y. Kloog. 1993. Nerve growAth factor induces a succession of increases in 
isoprenylated methylated small GTP-binding proteins of PC12-pheochromocytoma cells. 
Neuropeptides. 24: 11-25 [Medline] , 

23. Halegoua, S., and J. Patrick. 1980. Nerve growth factor mediates phosphorylation of specific proteins. 
Cell. 22: 571-581 [Medline] . 

24. Hasobe, M., J.G. McKee, H. Ishii, M. Cools, R.T. Borchardt, and E. DeClercq. 1989. Elucidation of 
the mechanism by which homocysteine potentiates the anti-vaccinia virus effects of the 
S-adenosylhomocysteine hydrolase inhibitor 9-(trans-2', trans-3'-dihydroxycyclopent-4'-enyl) 
derivatives of adenine and 3-deazaadenine on the metaboHsm of S-adenosylhomocysteine in mouse 
L929 cells. Mol Pharmacol 36: 490-496 [Abstract] . 

25. Hershfield, M.S.. 1979. Apparent suicide inactivation of human lymphoblast S-adenosylhomocysteine 
hydrolase by 2'-deoxyadenosine and adenosine arabinoside. J. Biol Chem. 254: 22-25 [Medline] . 

26. Hildeshein, J., R. Hildeshein, and E. Lederer. 1972. New syntheses of S-adenosyl homocysteine 
analogues, potential methyltransferase inhibitors. Biochimie (Paris). 54: 989-995 [Medline] , 

27. Hrycyna, C.A., and S. Clarke. 1993. Modification of eukaryotic signaling proteins by COOH-terminal 
methylation reactions. Pharmacol Ther. 59: 281-300 FMedline] . 

28. Huff, K., D. End, and G. Guroff. 1981. Nerve growth factor-induced alteration in the response of 
PC12 pheochromocytoma cells to epidermal growth factor. /. Cell Biol 88: 189-198 [Medline] . 



f27 



7/23/01 3:51 PM 



Cimatoetal. 138(5): 1089 



http://wwwjcb.0rg/cgi/content/fuII/l 38/5/1 08 



29. Hunter, T.. 1995. Protein Icinases and phosphatases; the yin and yang of protein phosphorylation and 
signaling. Cell. 80: 225-236 [MedlineL 

30. Kaplan, D.R., and R.M. Stephens. 1994. Neurotrophin signal transduction by the Trk receptor. J, 
Neurobiol IS: 1404-1417 TMedlinel . 

31. Kredich, N.M., and D.W. Martin Jr.. 1977. Role of S-adenosylhomocysteine in adenosine mediated 
toxicity in cultured mouse T lymphoma cells. Cell 12: 931-938 [Medline] . 

32. Kujubu, D.A., J.B. Stimmel, R.E. Law, H.R. Herschman, and S. Clarke. 1993. Early responses of 
PCI 2 cell to NGF and EGF: effect of K252a and 5'-methylthioadenosine on gene expression and 
membrane protein methylation. /. Neurosci. Res. 36: 58-65 [Medline] . 

33. Laemmli, U.K.. 1970. Cleavage of structural proteins during assembly of the head of bacteriophage 
T4. Nature (Lond,), 222: 680-685 . 

34. Laskey, R.A., and A.D. Mills. 1975. Quantitative fihn detection of and ^"^C in polyacrylamide gels 
by fluorography. Eur J, Biochem. 56: 335-341 [Abstract] , 

35. Lee, J., and J. Stock. 1993. Protein phosphatase 2A catalytic subunit is methyl-esterified at its 
carboxyl terminus by a novel methyltransferase. 7. Biol Chem. 268: 19192-19195 [Abstract] . 

36. Lee, J., Y. Chen, T. Tolstykh, and J. Stock. 1996. A specific protein carboxyl methylesterase that 
demethylates phosphoprotein phosphatase 2 A in bovine brain. Proc. Natl Acad. Sci. USA, 93: 
6043-6047 [Abstract] . 

37. Lin, W.-J., J.D. Gary, M.C. Yang, S, Clarke, and H.R. Herschman. 1996. The mammalian 
immediate-early gene TIS21 protein and the leukemia-associated BTGl protein interact with a 
protein-arginine //-methyltransferase. y. Biol Chem. 271: 15034-15044 [Abstract/Full Text] . 

38. Liu, Q., and G. Dreyfuss. 1995. In vivo and in vitro arginine methylation of RNA-binding proteins. 
Mol Cell Biol 15: 2800-2808 [Abstract] . 

39. Liu, S., M.S. Wolfe, and R.T. Borchardt. 1992. Rational approaches to the design of antiviral agents 
based on S-adenosyl-L-homocysteine hydrolase as a molecular target. Antivir. Res. 19: 247-265 
[Medline] , 

40. Luo, L., T.K. Hensch, L. Ackerman, S. Barbel, L.Y. Jan, and Y.N. Jan. 1996. Differential effects of 
the Rac GTPase on Purkinje cell axons and dendritic trunks and spines. Nature (Lond.). 379: 837-840 
[Medline] , 

4L Maher, P.A.. 1993. Inhibition of the tyrosine kinase activity of fibroblast grov^h factor receptor by the 
methyltransferase inhibitor 5'-methylthioadenosine. 1 Biol Chem. 268: 4244-4249 [Abstract] . 

42. Marshall, CJ.. 1995. Specificity of receptor tyrosine kinase signaling: transient versus sustained 
extracellular signal-regulated kinase activation. Cell 80: 179-185 [Medline] . 

43. Meakin, S.O., and E.M. Shooter. 1991. Tyrosine kinase activity coupled to the high-afFinity nerve 
grov^^h factor-receptor complex. Proc, Natl Acad. ScL USA. 88: 5862-5866 [Abstract] . 

44. Metz, S.A., M.E. Rubaglia, J.B. Stock, and A. Kov/Iuru. 1993. Modulation of insulin secretion from 
normal rat islets by inhibitors of the post-translational modifications of GTP binding proteins. 
Biochem. J. 295: 31-40 [Medline] . 

45. Miller, M.W., D.M.J. Duhl, B.M. Winkes, F. Arredondo-Vega, P.J. Saxon, G.L. Wolff, C.J. Epstein, 
M.S. Hershfield, and G.S. Barsh. 1994. The mouse lethal nonagouti (a^j mutation deletes the 
S-adenosylhomocysteine hydrolase (Ahcy) gene. EMBO (Eur. Mol Biol Organ.) 7. 13: 1806-1816 
[Abstract] . 

46. Mobley, W.C., A. Schenker, and E.M. Shooter. 1976. Characterization and isolation of proteolytically 
modified nerve grov/th factor. Biochemistry. 15: 5543-5552 [Medline] . 

47. MoephuH, S.R., N.W. Klein, M.T. Baldwin, and H.M. Krider. 1997. Effects of methionine on the 
cytoplasmic distribution of actin and tubulin during neural tube closure in rat embryos. Proc. Natl. 



* " Cimato et al. I i« (5): i 089 http://www.jcb.0rg/cgi/content/fuiI/1 38/5/1 089 

Acad. Sci. USA. 94: 543-548 F Abstract/Full Text] . 

48. Najbauer, J., and D.W. Aswad. 1990. Diversity of methyl acceptor proteins in rat pheochromocytoma 
(PC 12) cells revealed after treatment with adenosine dialdehyde. J. Biol. Chem. 265: 12717-12721 
[Abstract] . 

49. Parish, C.A., A.V. Smrcka, and R.R. Rando. 1995. Functional significance of Pv-subunit carboxyl 
methylation for the activation of phospholipase C and phosphoinositide 3-kinase. Biochemistry. 34: 
7722-7727 [Medlinel . 

50. Pawson, T.. 1985. Protein modules and signaling networks. Nature (Lond.). 373: 573-580 . 

51. Peng, X., L.A. Greene, D.R. Kaplan, and R.M. Stephens. 1995. Deletion of a conserved 
juxtamembrane sequence in trk abolishes NGF-promoted neuritogenesis. Neuron. 15: 395-406 
[MedHnel . 

52. Philips, M.R., M.H. Pillinger, R. Staud, C. Volker, M.G. Rosenfeld, G. Weissmann, and J.B. Stock. 
1993. Carboxyl methylation of Ras-related proteins during signal transduction in neutrophils. Science 
(Wash. DC). 259: 868-872 . 

53. Philips, M.R., R. Staud, M. Pillinger, A. Feoktistov, C. Volker, J.B. Stock, and G. Weissmann. 1995. 
Activation-dependent carboxyl methylation of neutrophil G-protein 7 subunit. Proc. Natl. Acad Sci 
USA. 92: 2283-2287 FAbstractl . 

54. Qui, M.-S., and S.H. Green. 1992. PC 12 cell neuronal differentiation is associated with prolonged 
p2r"* activity and consequent prolonged ERK activity. Neuron. 9: 705-717 fMedlinel . 

55. Rando, R.R.. 1996. Chemical biology of protein isoprenylation/methylation. Biochim. Biophys. Acta 
1300: 5-16 [Medline]. 

56. Razin, A., and H. Cedar. 1991. DNA methylation and gene expression. Microbiol. Rev. 55: 451-458 
FMedline] . 

57. Rukenstein, A., R.E. Rydel, and L.A. Greene. 1991. Multiple agents rescue PC12 cells from 
serum-free cell death by translation, and transcription- dependent mechanisms. J. Neurosci. 11: 
2557-2563 . 

58. Seeley, P.J., A. Rukenstein, J.L. Connolly, and L.A. Greene. 1984. Differential inhibition of nerve 
growth factor and epidermal growth factor on the PC 12 pheochromocytoma line. J. Cell Biol. 98: 
417-426 [Abstract] . 

59. Shapiro, M.J., I. Chakrabarti, and D.E. Koshland. 1995. Contributions made by individual 
methylation sites of Escherichia coli aspartate receptor to chemotactic behavior. Proc. Natl. Acad. 
Sci. USA. 92: 1053-1056 FAbstractl . 

60. Soto, A.M., and C. Sonnenschein. 1985. The role of esfrogens on the proliferation of human breast 
tumor cells (MCF-7).y. Steroid Biochem. 23: 87-94 FMedlinel . 

61. Tischler, A.S., and L.A. Greene. 1978. Morphologic and cytochemical properties of a clonal line of 
rat adrenal pheochromocytoma cells which respond to nerve growth factor. Lab. Invest. 39: 77-89 
[Medline] . 

62. Turowski, P., A. Fernandez, B. Favre, N.J.C. Lamb, and B.A. Hemmings. 1995. Differential 
methylation and altered conformation of cytoplasmic and nuclear forms of protein phosphatase 2 A 
during cell cycle progression, y. Cell Biol. 129: 397-410 FAbstractl . 

63. Vojtek, A.B., and J.A. Cooper. 1995. Rho family members: activators of MAP kinase cascades. Cell 
82: 527-529 FMedlinel . 

64. Xie, H., and S. Clarke. 1994. Protein phosphatase 2A is reversibly modified by methyl esterification 
of its C-terminal leucine residue in bovine brain. J. Biol. Chem. 269: 1981-1984 F Abstract] . 



f27 



7/23/01 3:51 PM 



t-Cimatoetal. 138(5): 1089 

Copyright © 1997 by The Rockefeller University Press, 



http://www.jcb.org/cgi/content/rull/ 1 3«/i/ 1 USy 



This article has been cited by other 
articles: 

• Pawlak, M. R., Scherer, C. A., Chen, J., Roshon, M. J., 
Ruley, H. E. (2000). Arginine N-Methyltransferase 1 Is 
Required for Early Postimplantation Mouse Development, 
but Cells Deficient in the Enzyme Are Viable. MoL Cell 
Biol 20: 4859-4869 [Abstractl [Full Text] 



► Abstract of this Article 

► Reprint (PDF) Version of this Article 

► Similar articles found in: 

JCB Online 
PubMed 

► PubMed Citation 



► This Article has been cited by: 

► Search Medline for articles by: 

Cimato, T. R. II Aletta. J. M. 

► Alert me when: 

new articles cite this article 

► Download to Citation Manager 



Hbme j Helpijf Feedback i SubscriDtions Archive i Search i; Table:oftGontents 



>f27 



7/23/01 3:51 PM 



Neuron, Vol. 24, 65&-672, November, 1993. Copyright ©1999 by Cell Press 



Cypin: A Cytosolic Regulator of PSD-95 
Postsynaptic Targeting 



Boiinie L. Firestein/ Jay E. Brenman/ 

Chiye Aoki.t Ana M. Sanchez-Perez/ 

Alaa El-Din El-Husseini/ and David S. Bredt** 

• Department of Physiology 

University of California, San Francisco 

San Francisco, California 94143 

t Center for Neural Science 

New York University 

New York, New York 10003 



Summary 

Postsynaptic densKy 95 (PSD-95/SAP-90) is a mem- 
brane associated guanylate kinase (GK) PDZ protein 
that scaffolds glutamate receptors and associated sig- 
naling networks at excitatory synapses. Affinity chro- 
matography identifies cypin as a major PSD-95-binding 
protein in brain extracts. Cypin is homologous to a 
family of hydrotytic bacterial enzymes and shares 
some similarity with collapsin response mediator pro- 
tein (CRMP), a cytoplasmic mediator of semaphorin 
III signalling. Cypin is discretely expressed in neurons 
and is polarized to basal membranes in intestinal epi- 
thelial cells. Overexpression of cypin in hippocampal 
neurons specifically perturbs postsynaptk: trafficking 
of PSD-95 and SAP- 102, an effect not produced by 
overexpression of other PDZ ligands. In fact PSD-95 
can induce postsynaptic clustering of an otherwise 
diffusely localized K'^ channel, Kv1.4. By regulating 
postsynaptk: protein sorting, cypin may influence syn- 
aptic development and plasticity. 



Introduction 

Synaptic transmission requires spatial and functional 
assembly of neurotransmitter receptors and associated 
signal transduction machinery at synaptic sites. Al- 
though mechanisms for the assemb^ of postsynaptic 
protein networks remain largely unknown, recent stud- 
ies demonstrate an important structural role for postsyn- 
aptic density 95 (PSD-95 or SAP- 90) and related mem- 
brane-associated guanylate kinase (MAGUK) proteins 
(Kim et al., 1995; Komau et al., 1995; Brenman et al., 
1996a), PSD-95 was originally isolated as a major protein 
constituent of the PSD (Cho et at., 1992; KIstner et al., 
1993). Molecular cloning indicated that PSD-95 is highly 
homologous to two previously identified junctional pro- 
teins, mammalian zonula occludens 1 (ZO-1) and Dro- 
sophila discs-large (DLG), that occur at epithelial cell 
tight junctions (Woods and Bryant 1991; Witlott et al., 
1993). In addition to their roles at sites of epithelial cell 
contact MAGUKs also mediate important physiological 
functions at the synapse, as demonstrated by genetic 
analyses. Mutations of DLG, which is expressed at the 

f To whom correspondence should be addressed (e-mail: bredt® 
itsa.ucsf.edu). 



fly glutamatergic neuromuscular junction, disrupt post- 
synaptic structure and result in abnonnatty large synap- 
tic currents (Lahey et al.. 1994; Budnik et al., 1996). 
Targeted disruption of PSD-95 in mice does not grossly 
alter synaptic structure but does p»erturb hippocampal 
synaptic plasticity and, presumably through this mecha- 
nism, blocks spatial teaming (Migaud et al., 1998). 

MAGUK proteins regulate synaptic function by acting 
as molecular scaffolds to organize and cluster signaling 
machinery at the synapse. PSD-95 and the related neu- 
ronal MAGUKs, PSD-93 (or Chapsyn-110), SAP-97 (or 
hDLG), and SAP-102, all contain multiple protein-protein 
interaction motifs, including three N-terminal PDZ do- 
mains, an SH3 domain, and a C-terminal region homolo- 
gous to yeast guanylate kinase (GK) (Muller et al., 1995, 
1996; Brenman et al., 1996a; Kim et al., 1996). PDZ 
domains from PSD-95 bind to N-methyl-D-aspartate- 
(NMDA-) type glutamate receptors (Komau et al., 1995), 
Shaker (Kim et al., 1995), and inward rectifier K^ chan- 
nels (Cohen et al., 1996), and to certain peripheral mem- 
brane proteins containing a C-terminal PDZ-binding 
consensus sequence (tSXV), Thr/Ser-X-Val/lle-COOH 
(Komau et al., 1997; Sheng and Wyszynski, 1997). PDZ 
domains also bind to internal motifs in target proteins, 
as exemplified by the PDZ-PDZ domain interaction t>e- 
tween PSD-95 and neuronal nitric oxide synthase (Bren- 
man et al., 1996a; Hillier et al., 1999). By aggregating 
neurotransmitter receptors and downstream enzymes, 
PSD-95 and other multi-PDZ proteins act as molecular 
scaffolds that mediate subcellular protein compartmen- 
talization to ensure the selective activation of different 
signal transduction cascades within a single celt (Craven 
and Bredt, 1998; Scott and Zuker, 1998). 

To help understand the assembly of synaptic signaling 
complexes, recent studies have evaluated mechanisms 
for the postsynaptic clustering of PSD-95. These analy- 
ses show that postsynaptic targeting requires rwt only 
PDZ domains but also N-terminal palmitoytation of PSD- 
95, the extended SH3 region, and a short C-terminal 
targeting motif (Arnold and Clapham, 1999; Craven et 
al., 1 999). Some important implications of this work are 
that (1) assembly of synaptic complexes is an active 
process that requires multiple levels of protein sorting 
and (2) postsynaptic targeting cannot simply be ex- 
plained by PDZ domain-target interactrons. 

Here, we sought to identify proteins that regulate inter- 
mediate steps in postsynaptic protein sorting by iso- 
lating detergent-soluble PSD-95-interacting proteins. 
Using a PSD-95 protein column for affinity chromatogra- 
phy, we have isolated a highly abundant novel protein, 
cypin (cytoplasmic PSD-95 interactor), which is a PSD- 
95 binding protein in brain extracts. Cypin terminates 
with a PDZ-interacting sequence, -SSSV, that has a 
unique binding requirement for PDZ domains 1 and 2 
(PDZ1 and PDZ2). Through this interacting domain, 
cypin associates with multiple members of the PSD- 
95 family. Cypin is expressed throughout the body but 
occurs at the highest levels in neurons and intestinal 
epithelial cells, in which cypin protein is polarized to the 
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Figure 1 . Purirication of Cypin 

(Top) PSD-95-GST coupled to Affigel-10 beads was used for affinity 
chromatography of solut>iBzed brain extract Brain extract loaded 
onto a GST-Affigel-10 column and a PSD-95-GST- Affigel-10 column 
loaded with buffer alone served as controls. Silver staining shows 
a single band of 50 kDa that is specifically retained by the PSD-95 
beads. A 30 amino add peptide sequenced from this band is shown. 
(Bottom) Western blottvig shows that the channel Kv1.4 also 
specifically trinds to the PSD-9S column. 



basal membrane. Ultrastructural studies in brain indi- 
cate that cypin occurs in dendritic shafts and in the neck 
of synaptic spines but is typically excluded from the 
spine head and PSD. Consistent with this cytosolic dis- 
tribution, overexpression of cypin in hippocampal neu- 
rons disrupts the synaptic clustering of PSD-95 and 
SAP-102. This is a specific effect of cypin, as overex- 
pression of other diffusely expressed PDZ ligands does 
not affect the synaptic targeting of PSD-95. On the con- 
trary. PSD-95 can induce postsynaptic clustering of the 
cotransfected channel Kv1.4. 

Results 

Purification of Cypin, a PSD-95-Binding Protein 
in Brain 

To identify PSD-95-binding proteins in brain, we gener- 
ated a PSD-95-GST (glutathione S-transferase) fusion 
protein for affinity chromatography. We applied deter- 
gent-solubilized rat brain extracts to the PSD-95 column 
and in parallel to a control GST column. Both columns 
were washed extensively, and t>ound proteins were 
eluted with 4 M urea and were analyzed by SDS-PAGE. 
Silver staining revealed a single band of ^50 kDa that 
was retained by the PSD-95 column but not by the con- 
trol GST column (Figure 1, arrow). Tubulin, the most 
abundant protein in the brain extract, bound nonspecifi- 
cally to both columns (Figure 1A). A small amount of the 



PSD-95 fusion protein eluted from the column and also 
eluted from a conuol PSD-95 column loaded with solubi- 
lization buffer alone. 

Previous studies have identified a numt)er of PSD- 
95-bindtng proteins, and we expected these should also 
be present in the eluate fractions. Western blotting indi- 
cated that known PSD-95 -binding proteins, including 
neuronal nitric oxide synthase (nNOS), Kv1 .4, and NMDA 
receptor 28, were all purified by the PSD-95 column 
(Figure 1; data not shown). However, these proteins are 
apparently of much lower abundance in Triton-soluble 
extracts than is the 50 kDa protein, and they were not 
apparent by silver staining. 

To determine the structure of this PSD-95-binding 
protein, hereafter named cypin, we pooled the appro- 
priate column fractions and purified the 50 kDa protein 
band by preparative SDS-PAGE. Following electropho- 
resis, proteins were transfen^ed to a polyvinylidene difiu- 
oride (PVDF) nrwmbrane, and the 50 kDa band was ex- 
cised. The band was digested with cyanogen bromide, 
and the released peptides were treated with o-phthalal- 
dehyde. which blocks all peptides that lack an N-termi- 
nal proline. Microsequendng following this blocking 
procedure yielded a single unequivocal peptide se- 
querice of 30 amino acids (Rgure 1). The sequence of 
this peptide perfectly matched the conceptual transla- 
tion of an expressed sequericed tag (EST) in the public 
database. This EST was used to probe a rat brain cDNA 
lit>rary, and overlapping clones encoding a novel protein 
of 50 kDa were obtained (Figure 2A). The overall se- 
querK:e of cypin shares ^40% identity with an evolution- 
arily conserved gene family of proteins of unknown func- 
tion (Rgure 2A). Members of this family have been 
identified as open reading frames in the genomes of 
organisms from several kingdoms, including prokary- 
otes, fungi, and archea bacteria. Cypin also shares ho- 
motogy with the family of atrazir^e hydrolases, bacterial 
enzymes that dechlorinate the herbickJe atrazirie (Shao 
et al, 1995). An EST encoding the C-terminal 1 36 amino 
acids of human cypin shows 94.1% sequence identity 
with the rat cDNA (Figure 2A). In addition, Anthrobacter 
D-hydantoinase, an atrazine hydrolase, and cypin share 
homology with collapsin response mediator protein 
(CRMP) (Figure 2A). 

Northern blotting with a cypin cDNA probe identifies 
a single band of 6.4 kb in mRNA purified from rat brain 
(Figure 2B). In addition, we detect cypin message in rat 
liver and spleen as well as in human brain, placenta, 
liver, and kidney (Figure 28). Cypin mRNA is not present 
in human heart lung, skeletal muscle, or pancreas (Fig- 
ure 2B). 

We devek>ped a polyclonal cypin antiserum, which 
recognizes a 50 kDa band in crude tissue extracts (Fig- 
ure 2C). The amiserum faintly cross-reacts with nonspe- 
cific bands of 80 and 110 kDa in skeletal muscle; 60, 
68, and 95 kDa in brain; and 120 kDa in heart (Figure 
2C). The distribution of the 50 kDa cypin protein band 
is generally similar to the pattem of cypin mRNA expres- 
sion (Figures 28 and 2C). The highest levels of cypin 
protein are detected in extracts from kidney, liver, lung, 
brain, and spleen (Figure 2C). Lower levels of cypin are 
present in extracts from placenta, heart, and skeletal 
muscle, and no cypin was expressed in pancreas. It is 
curious that high levels of cypin protein are found in rat 
lung, whereas cypin mRNA is not found in human lung. 
We do not yet understand the root of this difference. 
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Figiirea. cDNA Oonng of C>pn and Characterization of Cypin Expressi^ 

QAJ, Top) ATignmem of rat cypin with a homologous protein from Saccharomyces cerevisiae, atrazine hydrolase from Rhodococcus corafl'mus 
with the conceptual translation of an EST encoding human cypin. 

QA), Bottom) Sequence alignment shows a region of significant homology t>etween CRMP, cypin, and Arthrobacter 0-hydantoinase. 
(B) mRNA for cypin is widely expressed in rat and human tissues. Northern blotting reveals that cypin mRNA (^.4 kb) occurs in rat brain, 
liver, and spleen. In mRNA purified from human tissues, cypin is expressed in brain, placenta, liver, and kidney but is not detected in heart 
lung, skeletal muscle, or pancreas. 

{Q Westem btotting shows that cypin protein is expressed at high levels in kidney, fiver, lung, brain, and spleen. Cypin is expressed at lower 
levels in skeletal muscle, placenta, and heart and is absent from pancreas. 



Cypin Interacts with the First and Second POZ 
Domains of PSD-95 via a C-termlnal PDZ 
Binding-Site 

As cypin is an abundant PSD-95 binding protein in deter- 
gent-soluble brain extracts, we sought to characterize 
its mode of interaction with PSD-95. GST fusion protein 
affinity chromatography was performed using the PDZ1, 



PDZ2, PDZ1 and PDZ2. PDZ1-PDZ3, SH3. and GK do- 
mains, as welt as full-length PSD-95. Figure 3A shows 
that cypin binds to PDZ1 and PDZ2 together but not to 
either PDZ domain alone. Binding to PDZ1 and PDZ2 is 
quamitatively simitar to cypin binding to futt-length PSD- 
95 (Figure 3B). In addition, cypin does not bind to the 
SH3 or GK domains (Figure 3B; data not shown). As 
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Figures. Cypin Interacts with PDZ1 and 
PDZ2 of PSD-95 through a C-Terminal 
-SSSV Consensus Sequence 

(A) Schematic doman map of PSD-95. 

(B) GST fusion protein affinity chromatogra- 
phy shows that cypin binding to PSD-95 re- 
qijres PDZ1 and PDZ2. Cypin does not bind 
to either PDZ domain individually or to the 
SH3 domain from PSD-95. Input represents 
10% of the bram extract used for the pull- 
down assay. Retained cypin protein was de- 
tected by immunoWoiting. Yeast two-hybrid 
analysis corrfrms that cypin binds to PDZ1 
and PD22 of PSD-95 but not PDZ 1 or 2 alone. 
Protein interaction was measured and quanti- 
fied by colortmetric assay of p-galactosidase 
enzymatic activity, and values for the various 
constructs are shown beneath the blot. 

(C) Cypin binds to PSD-95 through a C-termi- 
nal consensus sequence. GST fusion protein 
affinity chromatography shows that F>SD-95 
binds to the last nine amino adds of cypin 
(cypin and 9 AA) but not to GST. Mutation of 
the critical serine (- 2) and valine (0) to alanine 
blocks PSD-95 binding to cypin (9 AA mut). 
(p) Deletion of the last four amino adds of 
cypin attenuates association with SAP-102 in 
vivo. COST eels were transfected with con- 
structs encoding the kidicated proteins. SAP- 
102 protein was immunopredpitated. and as- 
sociation of cypin or cypin mutant (Cypin*) 
lacking the last four amino adds was deter- 
mined by immunoblotting. 
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further support for cypin binding to both the PDZ1 and 
P0Z2 of PSD-95, we cotransformed yeast with expres- 
sion vectors encoding various PDZ domains from PSD- 
95 together with full-length cypin (Figure 3B). Comple- 
menting the affinity chromatography data, cypin binds 
to PDZ1 and PDZ2, whereas cypin fails to interact with 
either PDZ domain alone. 

Conspicuous in the cypin sequence is a C-terminal 
consensus (-SSSV), which conforms to the canonical 
tSXV motif for binding to PSD-95 PDZ domains (Kim et 
al., 1995; Kornau et al., 1995). To determine whether 
this C-terminal sequence is necessary and sufficient for 
binding to PSD-95, we perfonned afTmity chromatogra- 
phy using fusion proteins of full-length cypin or the last 
nine amino acids of cypin to evaluate binding to PSD- 
95 from rat brain (Figure 3C). We found that a GST fusion 
protein of cypin selectively retains PSD-95 and that a 
GST fusion protein containing only the final nine amino 



adds of cypin is sufficient to bind an equivalent anrK>unt 
of PSD-95 (Figure 3C). In contrast a construct of the 
C-terminal nine amino acids in which the critical serine 
(-2 position) and terminal valine were mutated to alanine 
does not bind to PSD-95 (Figure 3C). We also deter- 
mined if mutation of the last four amino acids of cypin 
disrupts association with MAGUK proteins in vivo. We 
found that SAP-102 coimmunoprecipitates with wild- 
type but not mutant cypin (Figure 3D). Reciprocally, im- 
munoprecipitation of cypin, but not cypin lacking the 
final four amino acids, coimmunoprecipitated PSD-95 
and SAP-102 (data not shown). 

To test further the specificity of the PSD-95-cypin 
interaction, we performed a reverse yeast two-hytnid 
screen of 7 x 10^ colonies from a rat brain cDNA library 
using cypin as bait. The screen yielded two independerrt 
clones; remarkably, both cypin-interacting clones en- 
coded I^AGUK proteins. The first clone encoded the 
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three PDZ domains of PSD-95 (amino acids 17-427); the 
second encoded PDZ1 and PDZ2 of SAP-102 (amino 
acids 63-376). These data confirm the strong and spe- 
cific interaction between cypin and MAGUK family 
members. 

Cypin Cofractionates wrth Members of the PSD-95 
Family of MAGUKs 

As all members of the PSD-95 family can bind the 
C-terminal -SSSV consensus sequence, we sought to 
determine which MAGUKs might interact with cypin in 
brain. The sut>cellular fractionation of cypin in brain fio- 
mogenates was first compared with that of various 
MAGUK proteins. Cypin is concentrated in synapto- 
somes and is further enriched in the synaptic vesicle 
fraction (Figure 4A). As previously described, PSD-95 is 
enriched in the heavy synaptic membranes. A lower band 
corresponding to a C-terminal fragmerrt off PSD-95 (Top- 
inka and Bredt, 1998) is enriched in the synaptic vesicle 
fraction (Figure 4A). SAP-97 is found in the synaptic 
membrane fraction and is highly enriched in the cyto- 
solic fraction. PSD-93 fractionates in a manner similar 
to that of PSD-95 and is most enriched in the heavy 
synaptic memt>rane fraction. Finally, SAP-102 has a sim- 
ilar profile to that of cypin; SAP-102 is also enriched in 
the synaptic membrane fraction (Figure 4A). The effi- 
ciency of fractionation was verified with synaptophysin, 
a synaptic vesicle marker that is enriched in the synaptic 
vesicle fraction (Figure 4A). These results show that cypin 
does fractionate with synaptic plasma membranes and 
vesicles and potentially could be present both in post- 
and presynaptic fractions, together with PSD-95/-93 
and SAP-97/-102, respectively. 

Cypin Associates with MAGUK Proteins in Brain 
Association of cypin with memt>ers of the F^D family 
was assessed more directly by immunopreclpttation. 
Rat brain membranes were solubilized with 0.5% deoxy- 
cholate, and protein complexes were immunoprecipi- 
tated with two independent antisera to PSD-95. Both 
sera precipitate cypin together with PSD-95 (Figure 4B, 
Immune 1, Immune 2). Quantitation of the immunopre- 
ctprtates showed that 30% membrane-bound cypin is 
associated with PSD-95. As a control, we found that no 
precipitatton of PSD-95 or cypin occurred wfien we used 
a preimmune serum (Figure 4B, Preimmune 1). Boiling 
of brain extracts in SDS prior to immunoprecipitation 
disrupted coimmurK)precipitation of cypin with PSD-95 
(Figure 4B, Imnuine 1 + SDS). In a reciprocal set of 
experiments, we immunoprecipitated cypin from solubi- 
lized membranes from rat brain and probed for copreci- 
pitation of PSD-95 family members. We found that neu- 
ronal MAGUK proteins PSD-95, SAP-97, and SAP-102 
cocmmunoprecipitate with cypin (Figure 4C). As a control, 
we found that calcium/calmodulin kinase II (CaMKII), an 
abundant postsynaptic protein, does not coprecipitate 
with cypin (Figure 4C). 

Cypin Is Enriched in Discrete Neuronal Populations 
in the Rat Brain 

To identify neuronal pathways that might t>e regulated 
by cypin, we mapped the cellular distribution of cypin 



in brain. Cypin protein is abundant in extracts from cere- 
bral cortex, hippocampus, olfactory bulb, and corpus 
striatum but is absent from cerebellum, spinal cord, and 
medulla (Figure 5A). In situ hybridization demonsUated 
that cypin mRNA is selectively enrfched in a subset of 
neurons in regions of the brain that express high cypin 
protein levels (Figure 5B). The highest levels of cypin 
mRNA are found in rat cerebral cortex and hippocampus 
and in a stripe of cells in the superior colliculus. Lower 
levels of cypin are detected diffusely in the corpus 
striatum (Figure 58). Like cypin protein, cypin mRNA is 
absent from cerebellum, medulla, and brainstem (Fig- 
ure 5B). 

Immunohistochemical staining also reveals a neuronal 
distribution for cypin protein (Figure 5B). Diffuse staining 
of neuropil is apparent in the cerebral cortex, striatum, 
and hippocampus. The highest levels of cypin protein 
are found in a stripe of neurons in the superior colliculus 
and in intemeurons scattered throughout the hippocam- 
pus (Figure 5Ca and 5Cb). Preabsorption of the antibody 
with antigen (20 jig/mQ blocks all staining (Figures 5Cc 
and 5Cd). These cell populations in the superior collicu- 
lus and hippocampus are intensely stained in their cell 
bodies and throughout their cell processes. Cypin-posi- 
tive neurons scattered in the hippocampus extensively 
colocalize with nNOS (Bredt et al., 1991), indicating that 
they are inhibitory intemeurons (Figures 5Ce and 5C0. 

Cypin Protein Is Polarized to the Basal Membrane 
of Intestinal Epithelial Cells 

Although selectively enriched in neurons in brain, cypin 
also occurs in certain peripheral tissues. In situ hybrid- 
ization and immunohistochemical staining of numerous 
peripheral tissues were used to survey which nonneu- 
ronal cells express cypin (data not shown). This analysis 
revealed that the highest levels of cypin throughout the 
body occur in the epithelial cells of intestine. In situ 
hybridization demonstrates the discrete expression of 
cypin nf)essage in epithelial cells in small intestine (Fig- 
ures 5 Da and 5Dc). Cypin mRNA expression is highest 
in differentiated epithelial cells at the tips of the intestinal 
villi, whereas expression is lower in immature cells in 
the crypts, and cypin is absent from submucosal and 
muscular layers. Hybridization with a probe encoding 
cypin sense sequence yielded no staining (Figures 5Db 
and 5Dd). Inrmiunohistochemical staining demonstrates 
that cypin protein expresskxi in intestinal epithelial cells 
is polarized and occurs exclusively at the basal mem- 
brane (Figures 5De and 5Df). 

Cypin Localizes to Both Pre- and Postsynaptic 
Sites in Cortical Neurons 

To assess the subcellular localization of cypin in neu- 
rons, we performed immunoelectron microscopy on 
sections of rat ceret>ral cortex. Examination of peroxi- 
dase-labeled sections revealed that the highest densi- 
ties of cypin occur discretely, in a subset of cortical 
neuronal processes, with only rare labeling of astrocytic 
elements. Within neurons, immunoreactivity is found 
most commonly within the cytoplasm and at the plasma 
membrane of dendritic shafts. Though PSDs are occa- 
sionally coated with the peroxidase label, cypin immu- 
noreactivity often occurs only in the dendritic shaft and 
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Figure 4. Cypin Cofractionates and Associ- 
ates with MAGUK Proteins in Brain 

(A) Subcellular fractionation of brain was per- 
formed as described in the Experimental Pro- 
cedures. Cypin is concentrated in synapto- 
somes, synaptic plasma membranes, and the 
synaptic veside fraction, which is enriched 
in synaptophysin. Like cypin, PSD-95 is en- 
riched in synaptosomes and in synaptic 
plasma niembranes, and a lower band corre- 
sponding to a C-terminal fragment of PSD- 
95 is enriched in the synaptic vesicle h-action. 
Other fWIAGUK proteins, PSO-93. SAP-97, and 
SAP-102, are also enriched in the synapto- 
somal fractions. 

(B) Cypin specificaly coinvnunoprecipitates 
with PSD-95 from brain. Rat brain menv 
branes were solubifized with 0.5% deoxycho- 
tate arrd were Immunopredpitated with two 
irKJependent antisera against FSD-95. Both 
sera avidly precipitated cypin, together with 
PSD-95 (Immune 1 and Immune 2). Pretm- 
miMie serum (Preimmune 1) does not coim- 
munopredpitate cypin. Boilirrg the brain ex- 
tract in SDS (Immune 1 + SDS) disrupts 
coimmunoprecipitation of cypin with PSD-95. 

(C) PSD-95 and other MAGUK proteins also 
occur in cypin immunopredpltate. Brain ho- 
mogenate sotubilized with 1% Triton X-100 
was Dumunopredpitated with a cypin antise- 
rum. Immunoblotting shows that PSD-95, 
SAP-97, and SAP-102 coinvnunopredpitate 
with cypin, whereas CaMKIl does not. 
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neck of the spine and is typically excluded from the 
spine head and PSD (Figure 6A). At shaft synapses, 
cypin immunoreactivity is most concentrated upon peri- 
synaptic saccules that aggregate adjacent to the weakly 
labeled postsynaptic membrane (Figure 6B). In addition 
to labeling dendritic shafts, cypin also is concentrated 
in a subset of axonal processes. Within axons, cypin is 
concentrated at the synaptic terminals, and the reaction 
product is often seen coating synaptic vesicles (Figure 
6C). This is consistent with subcellular fractionation, 
which shows that cypin is expressed both pre- and post- 
synaptically. 

Because the peroxidase label diffuses, we also used 
sih/er-intenstfied preembeddtng immunogold. to allow 
more precise subcellular localization of cypin. Again, we 



found that cypin is most prevalent within the cytoplasm 
of dendritic shafts. Immunogold particles rarely occur 
at the PSD. Instead, the immunoreactivity is more often 
found along the plasma memt>rane adjacent to the PSD 
and upon cytoplasmic perisynaptic saccules (Figures 
6D and 6E). 

Cypin Regulates Postsynaptic Sorting 
of MAGUK Proteins 

As cypin is enriched in dendritic shafts and perisynaptic 
domains, we asked whether cypin might regulate the 
postsynaptic sorting of associated MAGUK proteins. To 
address this, we evaluated the postsynaptic trafficking 
of PSD-95 and SAP-102 in hippocampal neurons. Hippo- 
campi were dissected from late gestation embryonic 
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Rgure 5. Cypin Is Selectively Expressed in a Subpopulation of Forebrain Neirons and Occurs at the Basal Membrane of Irrtestirial Epithelial 
Cels 

(A) Western blotting shows that cypIn protein is expressed in cortex hippocampus, striatum, and olfactory bulb but not in cerebeftum. meduHa, 
or spinal cord. 

(B) In situ hybridization of a sagittal rat brain section with an antisense probe shows high expression of cypin mRNA in cerebral cortex (CTX) 
and dentate gyrus (DG) and lower expression in the corpus striatum (STR). No specific signal Is detected with a sense probe. Immunohistocheml- 
cal staining of a sagittal brain section reveals a similar pattern of protein expression in discrete neuronal populations of forebrain and no 
cypin protein expression in cerebeilum. 

(Ca) Cypin is enriched in cell soma and processes ofa layer of neurons in superior colliculus. 

(Ct>-Cd) tmmunohistochemistry reveals thai cypin is hi^ly expressed vi hippocampal neurons, and preabsorpbon of the cypin antibody v«th 
antigen (20 ^g/mO blocks staining (Cd). 

(Ce and Cf) In hippocampus, cypin is highly expressed in intemeurons, which mostiy colocaDze with nNOS (closed arrowheads). A small 
percentage off cypin-positive neurons in hippocampus are nNOS negative (open arrowhead, fCd)). 

(P) Cypin expression is restricted to the basal membrane of polarized intestinal epithePal cells. An antisense probe to cypin hybridizes to the 
epithelial ceOs (Ep) of the intestine (bright field, [Da], dark fieW, (DcR whereas the submucosa (Sub) lacks cypin mRNA. No labeling is detected 
with a sense probe (bright field, [DbJ; dark fieW, [DdJ). Immunohistochemistry shows cypin protein is polarized to the basal membrane (BM) 
and is excluded from the apical domain (Ap) of intestinal epithelial cells (200x, [Del; 400x, [DfD. 
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Figure 6. Cypin Imniunoreactivity at Synap- 
tic and Nonsynaptic Portions of Neuronal 
Processes 

Electron micrographic immunohistochemical 
staining of cerebral cortex visualized and la- 
beted with either diaminobenzidine (A-C) or 
sitver-intensffied gold (SIG) (D and E). 

(A) The open arrow points to an unlabeled 
postsynaptic density within a spine head 
(Usp) that is cormected to an immunoreactive 
spine neck (between the two small arrows) 
emanating from an immunoreactive dendritic 
shaft (Sh). 

(B) Cypin immunoreactivrty occurs diffusely 
distributed within a dendritic shaft (Sh). The 
curved arrow points to the postsynaptic 
memtxane that is adjacent to an intensely 
immunoreactive smooth endoplasmic reticu- 
lum. The small arrow points to inrwnuroreac- 
tnrity along the nonsynaptic portion of the 
plasma membrane. 

(C) Cypin immunoreactivity occurs along the 
nonsynaptic portion of the plasma membrane 
(smal arrow), surrounding vesicles, and 
along the presynaptic memt>rane of an axon 
terminal, T. The open an-ow points to the unla- 
beted postsynaptic density of a spine (Usp). 

(D) SIG labeling away (circles) and along (ar- 
rowheads) the plasma membrane of two den- 
dritic shafts. 

, . ^ ^^.^ ... A cjendritic spine (Sp) emanating from a 

den*rtic shaft exhibits immunoreactnnty along the postsynaptic membrane adjacent to the postsynaptic density (arrowhead) and immedately 
beneath the postsynaptic density (large curved arrow). As seen in dendritic shafts, immunoreactivity occurs away from and along the plasma 
membrane. Open arrow points to a postsynaptic density devoid of cypin mmunoreactivity. 
Abbreviation: T, unlabeled tenminals. Scale bar. 1 ^m (A-C); and 500 nm (D and E). 
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rats, neurons were dissociated and transfected in sus- 
pension, and cultures were maintained for 14 days in 
vitro. In mature cultures, PSD-95 and SAP-102 are tar- 
geted to dendrites and are clustered at discrete spots 
Figure 7. These clusters likely reflect postsynaptic sites, 
as they colocalize with synaptophysin (Craven et al., 
1999; data not shown). Overexpression of cypin, which 
shows a diffuse expression pattern in the dendrites of 
transfected cells, reduces the number of PSD-95 clus- 
ters (Figure 7). To quantitate this effect we counted the 
number of synaptic clusters per micrometer of dendrite 
length (Table 1). We found that transfection with cypin 
causes a 2-fold reduction in the density of PSD-95 syn- 
aptic clusters, from 1 .8 clusters per micrometer to 0.9 
clusters per micrometer (Table 1). The effect of cypin 
on SAP-102 targeting is even more striking. Overexpres- 
sion of cypin completely abolishes synaptic clustering 
of SAP-102, from 1 .8 clusters per micrometer to 0 clus- 
ters per micrometer (Table 1). Feather than being clus- 
tered at the synapse, SAP-102 becomes diffusely dis- 
tributed in the dendrites of the transfected neurons 
(Figure 7). As a control, we found that expression of a 
cypin mutant lacking the last four amino acids does not 
affect clustering of PSD-95 (1 .8 clusters per micrometer 
to 1 .9 clusters per micrometer) or SAP-102 (1 .8 clusters 
per micrometer to 1.8 clusters per micrometer) (Figure 
7; Table 1). We considered the possibility that cypin may 
influence synaptic clustering of PSD-95 and SAP-102 
by altering the total number of synaptic sites. To assess 
this, we stained transfected cultures with a presynaptic 



martcer, synaptophysin, and found that transfection with 
cypin does not alter the total density of synapses (data 
not shovm). Thus, cypin appears to specifically regulate 
the synaptic targeting of associated MAGUK proteins. 

To determine whether overexpression of any soluble 
protein containing a PDZ-btnding motif might disrupt 
MAGUK clustering, we transfected hippocampal cul- 
tures with a cDNA encoding cyan fluorescent protein 
fused to the PD2-binding nrotif EKLSSIESDV (CFP- 
tSXV). Although CFP-tSXV binds to the PDZ2 of PSD- 
95 in vitro, as expected (data not shown), it has no effect 
on the clustering of PSD-95 or SAP-102 (Rgure 8B; 
Table 1). 

Finally, we asked whether a diffusely expressed trans- 
membrane MAGUK-binding partner might disrupt the 
clustering of PSD-95 or SAP-102. For this, hippocampal 
cultures were transfected with a cDNA encoding the K+ 
channel Kv1.4. When transfected alone, Kv1.4 occurs 
diffusely throughout the neuronal soma and in both the 
axonal and dendritic processes (Figure 8A). However, 
when cotransfected with either PSD-95 or SAP-102, 
Kv1.4 clusters at postsynaptic sites (Figure 8A). In the 
cotransfections, PSD-95 itself retains its normal expres- 
sion at somatodendritic clusters and is largely excluded 
firom axons. To determine whether the postsynaptic 
clustering of Kv1.4 targeting requires the PDZ-binding 
tSXV motif, we designed a Kv1.4 construct lacking the 
C-terminal intracellular region (Kv1.4 trunc). As seen in 
Frgure 8A, PSD-95 and SAP-102 do not target this trun- 
cated Kvl .4 to postsynaptic sites. 
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Rgure 7. Cypin Regulales Postsynaptic Chisteriig <rf PSD-95 and SAP-102 In Primaiy Hppocampal Neurons 

Following 11-14 days in vhro, PSD-SS and SAP-102 occur dustered at synaptic sites (green, [A and BD- Transfection with cypin (red. [C, D, 
G, and HD reduces the number of PSD-95 clusters (green, (CD and efiminates 102 clusters (green, (G)). Transfection with a cypin mutant 
(Cypin') lacking the last four amino adds (red, (E. F, I, and JD has no effea on the dustering of PSD-95 (green, (ED or SAP-102 (green. (ID. 
Scale bar. 10 ^m. 



Discussion 

Cypin as a Soluble PSD-95-Binding Protein in Brain 
Our affinity chromatography procedure isolated cypin 
as the most abundant protein in detergent-soluble brain 
extracts that can bind to PSD-95, The C terminus of 
cypin Ser-Ser-Ser-Var binds to PSD-95 with an affinity 
similar to that of other tSXV Itgands (data not shown). 



Therefore, the unique isolation of cypin by our procedure 
suggests that cypin is more abundant in brain extracts 
than other PSD-95 ligands. What are the implications of 
such a bountiful PDZ ligand in brain? One explanation 
is that cypin occurs in numerous subcellular compart- 
ments in neurons, wtiereas most other PDZ ligands oc- 
cur only at synaptic sites (Sheng and Wyszynski, 1997; 
Kennedy, 1 998; O'Brien et al., 1 998; Craven et al., 1 999). 
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Table 1. Dendritic Clustering in Cotransfected Neurons 




PSD-95 


SAP-102 


Alone 


1.8 ± 0.1 


1.8 i 0.1 


+ Cypin 


0.9 i 0.05* 


0 ± O" 


+ Cypin* 


2.0 ± 0.2 


1.9 ± 0.1 


+ PDZ-binding CFP 


1.9 i 0.OB 


1.9 ± 0.1 



The number of dusters per micrometer of dendrite is presented. 
Cypin* Is devoid of the last C-terminal amino adds. CFP-tSXV is 
CFP fused to the PDZ-bintfing moHS EKLSSIESDV. Four measure- 
ments per cell were averaged for four eels per group. Values are 
mean z SEM values. Asterisk, p < 0.01 compared with PSD-95 
alone, and double asterisk, p < 0.001 compared v«th SAP-102 atone 
(ANOVA. with Bonferroni correction for multiple comparison). 



In addition to its enrichment at perlsynaptic dendritic 
domains, cypin also occurs in axons and is concentrated 
in a subset of nerve terminals (Figure 6). In these different 
neuronal domains, cypin has the opportunity to interact 
with distinct MAGUK proteins. Indeed, we find a robust 
association of cypin with SAP-97 (Figure 4), which Is 
concentrated in axons and neive terminals. This high 
abundance of cypin is also consistent with a structural 
role. Most previously identified PDZ domain ligands for 
PSD-95 are ion channels and enzymes (Kim et al., 1995; 
Kornau et al., 1995; Brenman et al., 1996b). These types 
of signaling molecules have primarily catalytic functions 
and are typically expressed at relatively tow levels. 
Cypin, on the other hand, appears to mediate certain 
actions by interaction with MAGUK proteins and would 
therefore need to occur at high levels in neurons. 

Cypin Is Widely Distributed and Highly Conserved 
in Phylogeny 

Proteins similar to cypin are encoded in the genomes 
of organisms from several kingdoms, including prokary- 
otes, fungi, and archea bacteria. Although these gene 
products highly related to cypin have no identified func- 
tion, cypin also shares significant homology (27.5% 
identity over 1 31 amino acids) with the family of atrazine 
chlorohydrolases (AtzA) present in Pseudomonas, Rho- 
dococcus, and certain other bacteria. In fact, cypin is 
more similar to these atrazine-catabolizing enzymes 
than is any other mammalian gene. Atrazine is a chlori- 
nated triazine herbicide used for controlling grassy 
weeds. Several microorganisms can metabolize atra- 
zine, and the first step involves dechlorination by AtzA. 

Why might this abundant PSD-95-binding protein in 
brain share homology wfith an ancient family of heterocy- 
cte-metabolizing enzymes? It may be relevant that PSD- 
95 itself contains a domain highly homologous to GK. 
The purine ring of guanosine is structurally similar to 
triazine, and cypin may cooperate with PSD-95 in gua- 
nine metabolism. It is alternatively possible that cypin 
metabolizes heterocyclic neurotransmitters derived from 
histidine, tryptophan, or adenine. It is interesting to note 
that AtzA and cypin share homology with CRMP (Go- 
shima et al., 1995) and with the Caenorhabditis elegans 
uncoordinated 33 {unc-33i gene (Li et al., 1992), which 
themselves are similar to the bacterial enzyme D-hydan- 
toinase (Figure 2). CRMP mediates intracellular re- 
sponses to semaphorin III and is required for sema- 
phorin Ill-induced growth cone retraction (Goshima et 



al., 1995). These actions of CRMP are presumably medi- 
ated by hydrolysis of neuronal hydantoins or pyrimidines 
(Wang and Strittmatter, 1 997). As cypin and CRMP both 
may hydrolyze nitrogen-based heterocycles, they may 
interact to regulate neuronal development. Finally, very 
recent studies demonsuate thai the glycine receptor- 
interacting protein gephyrin plays both a structural role 
in regulating glycine receptor clustering and a metabolic 
role in synthesizing the molybdoenzyme cofactor (Feng 
et al., 1998). Cypin may also be a dual function protein, 
with both structural and catabolic activities in neurons. 

Cypin Localizes to Specific Subcellular Domains 
in Neuronal and Nonneuronal Cells 
Outside the brain, the highest levels of cypin occur in 
intestinal epithelial cells. Within epithelial cells of both 
the large and small intestines, cypin localizes to the 
basal region of the lateral plasma membrane, and cypin 
is excluded from the apical membrane region. This baso- 
laterat membrane distribution is similar to that of SAP- 
97, one of the identified binding partners for cypin 
(Muller et al., 1995; Reuver and Gamer, 1998; Wu et al., 
1998). Efficient lateral membrane targeting of SAP-97 
requires PDZ1 and PDZ2 (Wu et al., 1998), which also 
compose the site for cypin binding. Cypin may therefore 
regulate SAP-97 localization and function in epithelial 
cells. 

In brain, cypin occurs discretely in several neuronal 
populations in foretxain, though cypin is completely ab- 
sent from hindbrain and spinal cord. In several forebrain 
regions, including the hippocampus, cypin is expressed 
at the highest levels in inhibitory interneurons. This find- 
ing fits with emerging evidence that the PSD-95-NMDA 
receptor complex is differentially regulated in excitatory 
and inhibitory neurons. Whereas PSD-95 and NMDA 
receptor subunits themselves occur ubiquitously at ex- 
citatory synapses in both glutamatergic and GABAergic 
neurons, several PSD-95-associated proteins are selec- 
tively expressed in specific cell types. In hippocampal 
neurons, synGAP Is highly enriched in excitatory neu- 
rons (Zhang et al., 1999), wh^eas several other MAGUK- 
binding proteins, including GKAP (Rao et al., 1998), cit- 
ron (Zhang et al., 1999), and nNOS (Bredt et al., 1991) 
resemble cypin and are enriched in inhibitory neurons. 
This alternative composition of the PSD can begin to 
explain the differences in signal transduction down- 
stream of NMDA receptors observed in GABAergic ver- 
sus glutamatergic neurons. For example, NMDA recep- 
tor-dependent long-term depression (LTP) does not 
occur in hippocampal interneurons (Maccaferri and 
McBain, 1996). As cypin is a highly abundant protein 
that presumably can compete with NMDA receptors and 
other synaptic ligands for interaction with PSD-95, cypin 
may regulate plasticity in hippocampal interneurons by 
altering PSD-95-associated signal transduction cas- 
cades activated by glutamate. 

Cypin as a Regulator of Postsynaptic Chistering 
of MAGUKs 

Functional expression in neurons indicates that cypin 
can regulate synaptic clustering of MAGUK proteins. 
This is particularly relevant in light of recent studies 
indicating that postsynaptic targeting of PSD-95 is an 
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Figure 8. PSD-95 and SAP-1Q2 Can Mediate Synaptic Clustering of Kv1.4 

(A) Kv1.4 is diffusely expressed thougiiout the neuron when transfected alone (red, [AaD. Coexpression with PSD-95 (greea (Ab-AdD or SAP- 
102 (green, e-g) targets Kv1.4 to postsynaptic sites (red, [Ac and AQ). Kv1.4 ladung the C-terminal tntraceflular region (Kv1.4 trunc; red, [AiD 
Is not targeted to postsynaptic sites by PSD-95 (green, (AhD. 

(B) Cyan fluorescent protein fused to the PDZ-txnding motif EKLS5IESDV (CFP-4SXV; blue, (Bb and BdD is diffusely expressed and does not 
affea postsynaptic targeting of PSD-95 (greea (Bal) or SAP- 102 (green, [BdD. Scale bar, 10 ^on. 



active cellular process that relies on N-terminal palmi- 
toylation, PDZ domains, an extended SH3 region, and 
a C-terminal targeting motif (Arnold and Clapham, 1999; 
Craven et al„ 1999). Whereas the palmitoylation and the 
C-terminal nwtif are likely Involved in cellular sorting 
pathways (A. E. E. and D. S. B., unpublished data; S. E. 
Craven and D. S. B., unpublished data), the PDZ domain 



interactions seem more likely to anchor PSD-95 at the 
PSD, which is replete with PDZ ligands. Consistent with 
this notion, overexpressibn of cypin, a nonsynaptic PDZ 
ligand in hippocampal cultures, dismpts synaptic tar- 
geting of PSD-95 and SAP- 102, and this process relies 
on the PDZ-binding site in cypin. This negative regula- 
tion of MAGUK protein clustering is a unique function 
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of cypin and is not reproduced by merely overexpressing 
a decoy PDZ-binding ligand. This suggests that specific 
structural features of cypin mediate the observed effects 
on postsynaptic clustering; the putative catalytic region 
of cypin that shares homology with atrazine hydrolase 
and CRMP is a possible candidate. Future studies on 
systematically mutagenizing cypin should help address 
this issue. 

Finally, we find that overexpression of PSD-95 or SAP- 
102 in cultured neurons recruits Kv1.4 to postsynaptic 
sites, which represents the first formal denwnstration of 
postsynaptic clustering by MAGUK proteins in neurons. 
This result must be compared with data in a very recent 
study (Arnold and Clapham, 1999), in which coexpres- 
sion of Kvl .4 and PSD-95 did not yield postsynaptic ion 
channel clustering. Instead, these investigators found 
that Kvl. 4 caused axonal targeting of PSD-95. Our ex- 
periments differ from those of Arnold and Clapham in 
several respects, including the neurons studied (hippo- 
campal versus cortical), the methods for neuronal cultur- 
ing (dispersed cultures versus slice cultures), and the 
times for protein expression after transfection (12 days 
versus 2 days). What is clear from these two studies is 
that the subtie interplay between PSD-95 and its PDZ- 
binding partners determines the cellular distribution of 
the complex. As such, cypin is as yet the first gene 
shown to negatively regulate synaptic targeting of this 
complex. 

It is important to note that hippocampal cultures con- 
sist primarily of excitatory and therefore cypin-depleted 
neurons, such that transfection of cypin into these cells 
markedly changes its cellular level and can explain the 
dramatic effect on MAGUK clustering. As cypin is an 
abundant protein expressed in specific neuronal popu- 
lations, where rt may regulate the number of synapses 
containing PSD-95 protein, the effect we find with trans- 
fection into relatively cypin-depleted hippocampal pyra- 
midal cells is likely to reflect its physk)logical role in 
other neuronal subpopulattons. Postsynaptic protein 
sorting is central to synaptogenesis. and dynamic re- 
cruitment of existing and newly synthesized proteins 
mediates aspects of LTP. By regulating postsynaptic 
targeting and interactions with PSD-95, cypin may have 
important roles in neuronal development and plasticity. 

Expenmeffital Procedures 
Antibodies 

Jhe foltowing primary antibodies were used: mouse monoclonal to 
nNOS (Transduction Latx)ratories). rabbit polyctonal to Kvl .4 (gen- 
erously provided by Dr. Uy Jan, University of Cafifomia, San Fran- 
cisco), rabbit polyclonal to PSD-95, giinea pig polyctonal to SAP-97 
(Topinfca and Bredt 1998). monoclonal to PSD-95 (Affinity Biore- 
agents). and morxxJonal to g-een fluorescent protein (Quantum), fem- 
munizing guinea pigs with a GST-SAP-102 fusion protein erK:oding 
the first 97 amino acids of SAP-102 raised the polyclonal antiserum 
to SAP-1 02. Immunizing rabbits with a fuD-lengtti GST-cypin fusion 
protein generated the polydona) antiserum to cypin. Al antisera 
were affinity purified on columns contariing the immunizing antigen 
linked to Affigel-IO resin. 

Protein Purtfication 

Affinity chromatography was performed as previously described 
(Fffestein and Bredt. 1999). Rve mg of GST. PSD-95-GST fusion 
protein, or bovine serum albumin (BSA) (Brenman et al., 1998) were 
cross-linked per mtltliter of Affigel-10 (Biorad). Adult rat brains (Z5) 



were homogenized in 10 volumes of ice-cold buffer B, Triton X-100 
was added to 1%. and proteins were extracted for 1 hr at 4^. The 
homogenate was spun at 12.000 x g for 10 mia and incubating 
with 2 ml of BSA-AfTigeHO beads precieared the supernatant The 
BSA beads were removed, and 100 ml of extract was incubated 
with 0-5 ml of PSD-95- Affigel- 10 at W for 3 hr. As controls, 10 ml 
of extraa was incubated with 50 »J of GST-Affigd-lO. and 10 ml of 
buffer B containing 1% Triton X-100 was incubated with 50 »J of 
PSD-95-Affigei-10. Extract/beads were then loaded into the appro- 
priately sized columns and were washed wiiti 100 column volumes 
of buffer B containing 0.5% NP40. 1% Triton X-100, and 1 mM 
phenylmethyisutfonyl fluoride (PMSF) foOowed by buffer B con- 
taining 1% Triton X-100. Proteins were eluted consecutively with 5 
column volumes of buffer B containing 1% Triton X-100 and 2.5 M 
irea (twice); 5 cdijnn volurnes of buffer D containing 1% Triton X-100 
and 4 M urea; and finaly, 2.5 ml of buffer B containing 1% SDS. 
Duate fractions were precipitated with 15% trichtoroacetic add. 
proteins were resolved by 8% SDS-PAGE, and bands were visual- 
ized with slver. 

The 2.5 M urea and 4 M urea fractions were combined, and pro- 
teins were separated on an 8% polyacryfamide gel and transferred 
to PVDF membrane (Immobtlon P, Mifipore). The membrane was 
stained with Coomassie blue to visualize bands, the desired protein 
bands were excised, and ProSeq (Salem, MA) performed protein 
sequencing, wNch yielded a smgle peptide sequence of 30 unam- 
t>iguous amino acids. 

GST fusion protein "pufl-down' assays (Brenman et al., 1996b) 
and synaptosomal fractionation (Rrestein and Bredt 1999) were 
done as previously described. 

cDMA Cloning and mRNA Analysis 

An EST was labeled and used to screen 10* phage plaques from a 
rat brain cDNA library (Stratagene). Five clones were isolated and 
sequenced on tx>th stands. These dones contained a single large 
open reading frame, which predicted a protein of 454 amino adds 
that contained ttie 30 amino add peptide sequenced from cypin 
protein. 

For Northern btotting, rat RNAs were isolated by the guanidine 
tsothiocyanate/Csa method, and mRNA was selected, using oligo 
dT sepharose. mMRNA was separated on formaldehyde agarose 
gels and transferred to nylon membranes. Equal amounts of mRNA 
were loaded as determined by ethidium bromide staining. The hu- 
man Northern blot was from aontech. Filters were hytmdized with 
a random primed prot>e generated with cypin cDNA as template. 
After hybricfization, membranes were washed at high stringency— 
68X, 0.1% SSC, 0.1% SDS— and exposed to X-ray film at -TO'C. 

In situ hybridization used '^S-lat)eled RNA probes and was per- 
formed exactly as described (Brenman et al., 1 996b). Tissue sections 
were exposed to X-ray flm for 4 days and then were dipped into 
photographic emulsion (Kodak NTB-2) and exposed for 14 days. 

Primary Neuronal Culture and Transfection 
Neuronal cultures were prepared from hippocampi of rat emtvyos 
at 18 days gestation. The hippocampi were dissociated with papain 
folowed by brief mechanical tritiration. Gels were plated on poly- 
D-lysine-treated glass coversips (12 mm in diameter) at a density 
of 200-250 ceBs/mm'. Cultures were plated and maintained in netro- 
basal media supplemented with B27. penicifin, streptomydn, and 
L-glutamine. Hippocampal cultures were transfected by a lipid- 
mediated gene transfer as previously descrft>ed (Craven et al., 1 999). 
Human embryonic kidney (HEK) and COST eels were transfected 
with LjpofectAMlNE Plus (GIBCO-BRL). 

tnm n rohi s t o cli e iiii sl iy 

Ligtit microscopk; irrvnunohistochemistry was done as previously 
described (Brenman et al., 1996a). Affinity-purified anti-cypin antise- 
rum was used at 1 ^g/ml, and preabsorption with antigen (20 ^.g/ 
mO abotished all staining. 

For electron microscopy, rats were perfused with 4% paraformal- 
dehyde and 3% acrolein 0.1 M phosphate buffer (pH 7.4). Sodium 
borohydrkle was used to terminate tissue fixation. Brain sections 
(40 ^^1) contairnng primary visual cortex were btocked in phosphate 
buffer containing 1% BSA. Following an overnight incubation with 
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affinity- purified anti-cypin antibody (1 iig/mfi, sections were pro- 
cessed for immunotabeling with ttie avkiirvbiotirT-HRP complex with 
the ABC Elite kit (Vector) and 3,3'-diaminobenzidine (Aldrich Chemi- 
cals) and hydrogen peroxide as substrates. Alternatively, sections 
were processed for preembedding colloidal gold immunolabeltng, 
using 1 nm gold-conjugated anthrat)bit Immunoglobulin G as the 
label. A s'Aver intensification kit (Amersham) was then used to enlarge 
the 1 nm gold particles to >5 nm for fight and electron mk:roscopic 
detection (Chan et al., 1990). These sections were postfixed with 
1% glutaraklehyde for 10 min, then with 1% osmium tetroxide for 
1 hr, and stained en bloc with 4% uranyt acetate. Sections were 
dehydrated for infiftration with the EMBED 812 resin (EM Sciences) 
for ultrathin sectioning. Sections (80 nm) were examined without 
lead citrate counterstaining. 

ImmunoprecipitaUon 

Frozen txrains were homogenized in 10 volumes of TEE (50 mM 
Tris-flCI [pH 7.4]. ISOmM NaCI. and 1 mM PMSF). Insokjble proteins 
were colected by centrifugation at 20,000 x g for 15 min, and the 
pellet was extracted in TEE with 0.5% deoxychoiate for 1 hr at 4*C. 
After predearing with proten A-Sepharose for 1 hr, lysates were 
immunoprectpitated with 10 )jig/ml polyclonal anti-cypin antibody 
or preommune arrtiserum finked to protein A-Sepharose (Pharmacia) 
for 2 hr at 4'X:. Beads vwere washed with TEE corrtaining 1% Triton 
X-100. and bound proteins were eluted in SDS loading buffer and 
analyzed by immunoblotting. For some experiments, the brain pellet 
was extracted with 0.2% SDS; this solubifized extract was heated 
to 95t^ for 5 mia and Triton X-100 was then added to 1% prior to 
immunoprecipitation. 

Transfected HEK293 or COS7 cells were tysed in 1 ml of TEE with 
1% Triton X-100 for 30 min at 4t:. Lysates were cleared of debris 
and used for coimmunopredpitation with 10 ^ml of polyckxial 
guinea pig anti-SAP-102 or monockxial ant»-PSD-95 (045, Affinity 
Bioreagents) as descritsed atxive for brain tissue. 

Yeast Two-Hybrid System 

FuD-length cypin cDNA was sutxioned into pGTB9 to generate a 
GAL4-binding domain fusion. TTiis construct was cotransfbrmed into 
yeast strain HF7c with a library of rat brain cDNAs fused to the 
GAt4 activatk>n domain (Oontech). The transformation mixture was 
plated onto a synthetic dextrose plate lackng tryptophan, leucine, 
and histkline. Growth was monitored during a S day incubation at 
30°C, and color was measured by a ^-galactosidase cotorimetric 
filter assay. Interacting clones were rescued, retransfomfied to con- 
firm interaction, and sequcrKcd. Complementary cDNAs encocf ng 
PDZ domains from PSD-95 were ctoned into pGAD-10 to generate 
GAL4 actfvatkNi domain fusion constnicts. Yeast (strain SY256) was 
cotransfbrmed with cypin, and the various PSD-95 deletion constructs 
and protein interactions were quantified by colorimetnc ^-9alactosi- 
dase assay. 
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Note Added in Proof 

While this manuscript was in review, a mouse protein Hghly similar 
to cypin was purified as a guanine deaminase (J. Biol. Chem. 274, 
8175-8189), suggesting that linked metabdtsm of guanine nucleo- 
tides by cypin and the guanylate kinase domain of PSD-95 may 
regulate synaptk: clustering. 
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Presenilins 1 and 2 are unglycosylated proteins with apparent molecular mass of 45 and 50 kDa, 
respectively, in transfected COS-1 and Chinese hamster ovary cells. They colocalize with proteins from the 
endoplasmic reticulum and the Golgi apparatus in transfected and untransfected cells. In COS-1 cells low 
amounts of intact endogeneous presenilin 1 migrating at 45 kDa are detected together with relative larger 
amounts of presenilin 1 fragments migrating between 18 and 30 kDa. The presenilins have a strong 
tendency to form aggregates (mass of 100-250 kDa) in SDS-polyacrylamide gel electrophoresis, which can 
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be partially resolved when denatured by SDS at 37 instead of 95 °C. Sulfation, glycosaminoglycan 
modification, or acylation of the presenilins was not observed, but both proteins are posttranslationally 
phosphorylated on serine residues. The mutations Ala-246 Glu or Cys-410 Tyr that cause 
Alzheimer's disease do not interfere with the biosynthesis or phosphorylation of presenilin 1. Finally, using 
low concentrations of digitonin to selectively permeabilize the cell membrane but not the endoplasmic 
reticulum membrane, it is demonstrated that the two major hydrophilic domains of presenilin 1 are oriented 
to the cytoplasm. The current investigation documents the posttranslational modifications and subcellular 
localization of the presenilins and indicates that postulated interactions with amyloid precursor protein 
metabolism should occur in the early compartments of the biosynthetic pathway. 



INTRODUCTION O 

Alzheimer's disease is a major health problem. Patients suffer fi^om a progressive dementia caused by 
massive neuronal loss in cortical and hippocampal areas of the brain (1-6). Neuropathological signs of the 
disease are tangles and amyloid deposits in the brain parenchyma, and amyloid deposits in the brain 
vasculature. The cause of the sporadic form of the disease is still unknown, although an increased risk is 
associated with the presence of apolipoprotein allele E4 (6, 7). On the other hand, familial early onset 
Alzheimer's disease is caused by point mutations in the amyloid precursor protein gene on chromosome 
21 (8), in the presenilin 2 (PS2)- gene on chromosome 1 ( 9-11 ), or, most frequently, in the presenilin 
1 (PSl) gene on chromosome 14 ( 12-15 ). Amyloid precursor protein (APP) is a type I integral membrane 
protein and is the precursor of the amyloid peptide, the main component of the senile plaques ( 1-3) . Point 
mutations in exons 16 and 17 of the APP gene cause alterations in the metabolism of APP. This results in 
an increased production of intracellular amyloid peptide containing carboxyl-terminal APP firagments 
and in an increased secretion of the potentially neurotoxic Pa4 peptide ( 1-3, 16 ). 63% of the amino acid 
residues in the sequences of the two presenilins are conserved, which strongly suggests that both proteins 
are involved in similar fimctions and have a similar pathogenic role in Alzheimer's disease. Based on 
computer algorithms, seven membrane spanning domains have been defined ( 9-15 ), although the 
possibility of nine transmembrane domains cannot be ruled out at this moment (17). The amino-terminal 
domain and the acidic loop domain, located between transmembrane domains six and seven, are 
hydrophilic and can be alternatively spliced (9, 13). The mutations that cause familial Alzheimer's disease 
are found all over the protein, but the hydrophilic loop constitutes a "hot spot" with nine different 
mutations described to date (5, LS). The biological fiinction of the presenilins remains essentially unknown, 
but, interestingly, the 103 carboxyl-terminal amino acid residues of PS2 can inhibit apoptosis in a 
"deathtrap" assay (18). Based on the homology with Caenorhabditis elegans proteins, roles in intracellular 
protein sorting and/or intercellular signal transmission have been proposed as well (19, 20). 

A hypothetical final common pathway in the pathogenesis of the genetic and sporadic forms of Alzheimer's 
disease has been postulated, based on the invariable occurrence of the amyloid deposits, the neurofibrillar 
tangles and the neurodegeneration in all affected brains. The central question is thus how mutations in the 
presenilin genes can cause this typical neuropathology. Studies indicating an increased production of more 
amyloidogenic Pa4 -(1-42) peptide in fibroblasts obtained from patients with presenilin mutations would 
support the amyloid hypothesis that postulates abnormal Pa4 amyloid peptide production and plaque 
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' formation as the pivotal event in the pathogenesis of the disease (2, 21). However, the observed increases 
in PA4-(1-42) peptide production were relatively small, and the suggested relationship between presenilin 
mutations and APP metabolism should be further corroborated by experiments showing directly the effect 
of presenilin mutations on APP processing in transfected cells or in brains of transgenic animals. Moreover, 
other aspects of APP metabolism such as the production of carboxyl-terminal APP fragments (16) should 
be investigated in greater detail. The possibility that presenilins interact with the cytoskeleton or exert their 
effect via apoptotic pathways should not be disregarded at this time (18). In addition to addressing these 
questions, basic information is needed on the subcellular localization, the posttranslational modifications, 
and the membrane orientation of the presenilins. 

We used COS-1 cells and CHO cells to express PSl and PS2, as well as Myc-tagged PSl and 

PS 1 -containing mutations that cause familial Alzheimer's disease. The biosynthesis of transfected and 

untransfected presenilins was studied using immunoblotting or metabolical labeling and 

immunoprecipitation assays. We demonstrate that transfected presenilins are phosphorylated on serine 

residues. Using immunofluorescence microscopy, we document the association of presenilin 1 with the 

early compartments of the biosynthetic pathway and demonstrate the cytoplasmic orientation of the two 
major hydrophilic domains. 



MATERIALS AND METHODS □ 

Constructs 

The cDNA coding for mouse PSl and human PSl, PS2, and PSl containing Ala-246 Glu (FADl) or 

Cys-410 Tyr (NIH2) mutations have been described (9, 12). A Myc-tagged PSl fragment was generated 
by PCR using primers 

S'-CGGGATCC ATT ATPTGAGCAAAAGCTr ATTTCTR AA K Af;r;A CJja ^ ^ ^ r^jj ^ rCTGCACCG 3' 
and 5 -GATCACATGCTTGGCGCCATAT-3' (the sequence coding for the Myc tag is underiined). This 
fragment was used to replace the NarVBamEl restriction fragment of PSl in pSG5. The resulting cDNA 
codes for PSl with the Myc tag (EQKLISEEDL) immediately after the initiator methionine, as confirmed 
by cDNA sequencing. Plasmids containing the cDNA for fiirin (22, 23) or reticulon/NSP (24, 25) were 
kindly provided by Dr. J. Creemers, Dr. A. Roebroek, and Dr. W. Van De Ven (Center for Human 
Genetics, Leuven, Belgium). 

Antibodies 

Polyclonal rabbit antisera B13, B14, and B15 were raised against peptide p45 (NDNRERQEHNDRRSLC), 
which is in the amino-terminal domain of PSl (12). Polyclonal rabbit sera B16 and B17 were raised against 
peptide p46 (EGDPEAQRRVSKNSKC) situated in the hydrophilic loop domain of PSl. Polyclonal rabbit 
antiserum 519 was raised against residues KDGQLIYTPFTEDTE(C). This antiserum reacts with a 
sequence in the second loop domain of PSl and PS2. The synthetic peptides p45 and p46 were 
manufactured by Eurogentec (Liege, Belgium), while peptide p519 was synthesized by solid-phase 
techniques and purified by reverse phase high pressure liquid chromatography in our laboratories.^ Rabbits 
were injected every 2 weeks with 100, 200, or 300 ng of peptide in complete Freund's adjuvant, coupled to 
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' keyhole limpet hemocyanin or bovine serum albumin (Pierce) and solubilized in PBS mixed with complete 
Freund*s adjuvant. Monoclonal antibody (mAb) PS 1-3, reacting with the peptide 
RRVSKNSKYNAESTERESQDTVAEN in the hydrophilic loop domain of PSl mAb 9E10 against the 
Myc tag (26), was kindly provided by Dr. J. Creemers. mAbs MON160, 161 and 162 and MON148 and 
152 against NSP/reticulon and furin have been described ( 22-25) . mAbs against the 

immunoglobulin-binding protein (BiP) and against the calcium pump Serca 2a (1108) were purchased from, 
respectively, StressGen (Victoria, Canada) and Affinity BioReagents (Neshanic Station, NJ). 

Cell Culture, Metabolic Labeling, Cell Extraction, and Immunoprecipitation 

The fibroblast cell line (designation AG07657, Coriell Institute) from an unaffected individual of the FADl 
lineage was cultured in Dulbecco's modified Eagle's medium supplemented with 20% fetal bovine serum 
(Life Technologies, Inc.). COSl cells were cultured as described previously in Dulbecco's modified Eagle's 
medium/Ham's F-12 with 10% fetal bovine serum (27) . CHO cells stably transfected with APP770 were 
kindly provided by Dr. B. Greenberg (Cephalon, West Chester, PA) and cultured in high glucose 
Dulbecco's modified Eagle's medium (Life Technologies, Inc.) supplemented with 0.1 mM minimal 
essential medium (nonessential amino acids), 3% fetal calf serum, and 250 nM methotrexate. 

COS cells were transfected using DEAE/dextran (27), while CHO cells were transfected using 
LipofectAMINE according to the instructions of the manufacturer (Life Technologies, Inc.). 

Metabolic labeling was done with 100 ^Ci of [^^S]methionine, 250 nCi of [^^SJsulfate, 60 ^iCi of 
[^H]glucosamine, 500 jiCi of [^HJpalmitic acid or [^H]myristic acid, or 500 |iCi of 
[^^P]orthophosphate/ml of the appropriate culture mediimi. All radiolabeled precursors were from 
Amersham. Cellular labehng was done for 4 h unless otherwise specified, and cell extracts were made as 
described previously (27). For analysis of phosphorylation, a postnuclear extract was prepared using 0.5% 
(v/v) Triton X-100 in Tris-buffered saline (TBS) containing proteinase inhibitors (100 imits/ml aprotinin, 
1 jxg/ml pepstatin) and tyrosine and serine/threonine phosphatase inhibitors (1 mM sodium orthovanadate, 
5 mM EDTA, 5 mM EGTA, 20 mM NaF). The nuclei were pelleted by centrifugation at 14,000 rpm 
(15 min) in a cooled Eppendorf centrifiige. Antisera were added to the cell extracts at a 1/250 dilution, and 
antibody-antigen complexes were collected by incubation with immobilized protein G (Pierce) overnight at 
4 °C. The immunoprecipitates were washed using Tris-buffered saline containing 1% (v/v) Triton X-100, 
1% (w/v) deoxycholate, and 0.1% (w/v) SDS (27). For analysis of phosphorylation, phosphatase inhibitors 
as detailed above, were added to all buffers. 

Phorbol myristic acid, phorbol dibutyric acid, forskolin, okadaic acid and staurosporin (all from Sigma) 
were added to the cell cultures during the last 30 min of the metabolic labeling at the indicated final 
concentrations. For in vitro labeling assays, inraiunoprecipitated PSl or PS2 bound on protein G beads 
were incubated with purified protein kinase A or C in Tris-buffered saline, in the presence of 0.1 mM 
[Y-'^^P]ATP, 2 mM Mg^*^ during 60 min at 30 °C. The precipitates were washed and analyzed in 
SDS-PAGE. Gels were quantitatively analyzed using a Phosphorlmager (Molecular Dynamics). 

For immunoblotting experiments, cells were scraped in PBS, centrifiiged (10 min, 1000 rpm), and 
solubilized in Laemmli sample buffer. Blots were stained with mAb 9E10 (Myc tag) ascites fluid (1/1000 
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dilution) or mAb PSl-3 hybridoma supernatant (1/100 dilution) and affinity-purified goat anti-mouse 
peroxidase-conjugated antibodies (1/10,000; Bio-Rad), using the sensitive ECL system (Amersham). 

Phosphoamino Acid Analysis 

Immunoprecipitated -^^P-labeled PS were size-fi-actionated in SDS-PAGE, and labeled bands were 
localized by autoradiography. The proteins were hydrolyzed in 6 N HCl (110 °C, 90 min). The hydrolysate 
was supplemented with phosphoamino acid standards and analyzed by two-dimensional thin-layer 
electrophoresis at pH 1.9 and 3.5 in the respective dimensions ( 28) . Phosphoserine, phosphothreonine, and 
phosphotyrosine were localized using ninhydrin staining, and the radioactive phosphoamino acid residues 
were visualized by autoradiography. 

Immunocytochemistry 

Fibroblasts were grown on coverslips coated with mouse collagen IV (Collaborative Biomedical Products; 
1 ^ig/cm^). Transfected COS-1 or CHO cells were cultured in Lab-TEK chamber slides (Nunc). Cells were 
washed twice in PBS, fixed in 4% formaldehyde in PBS for 10 min at room temperature, and washed three 
times in PBS and once in TBS. Cells were permeabilized with 0.02% (v/v) Triton X-100 in TBS for 20 min 
or with 0.2% (w/v) saponin for 10 min and washed with 0.1% Tween 20 in TBS. Nonspecific binding was 
blocked with 0.2% cold water fish gelatin, 2% bovine serum albumin, and 2% fetal calf serum (blocking 
buffer). Cells were probed with affinity-purified primary antibody 519(1 :25) or with immune serum 
(1:400). Appropriate FITC- and TRITC-conjugated anti-mouse, anti-rat, and anti-rabbit antibodies (Sigma) 
were used at 1/400 dilution. Preparations were viewed on a Nikon Diaphot 300 or a Zeiss Axiophot UV 
microscope. Digifized immunofluorescence images were obtained using an LSM419-inverted 
laser-scanning confocal microscope (Zeiss Inc.) and processed using NIH Image software. 

Selective permeabilization of the plasma membrane was obtained by incubating fixed cells in 10 mM Pipes 
buffer (pH 6.8) containing 0.3 M sucrose, 0.1 M KCl, 2.5 mM MgCl2, 1 mM EDTA, 5 ^g/ml digitonin 

during 15 min at 4 °C (29). Cells were washed with PBS and fiirther processed. A rat monoclonal antibody 

against the KDEL sequence (30, 31) was used to demonstrate permeabilization of the endoplasmic 
reticulum membrane. 



RESULTS n 

Immunocytochemical Localization ofPresenilin 1 in the Endoplasmic Reticulum and the Golgi Apparatus 

The subcellular localization of the presenilins was investigated in permeabilized and fixed fibroblasts using 
affinity-purified polyclonal antibody 519 raised against the peptide KDGQLIYTPFTEDTE, which is a 
conserved sequence in the second loop domain of PSl and PS2 (9, 12). A fine reticular staining in the 
cytoplasm and a more pronounced perinuclear staining was observed in the cells (Fig. XA)^ which partially 
colocalized with the staining obtained with a mAb against BiP, an endoplasmic reticulum marker. 
Preabsorpfion of antibody 519 with the peptide antigen resulted in loss of staining, demonstrating the 
specificity of the observed signals (Fig. iQ. 
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Fig. 1. Immunofluorescent staining of PSl/2 in untransfected 
flbroblasts. Fibroblasts were fixed with formaldehyde and 
permeabilized with Triton X-100. Cells were incubated with 
affinity-purified rabbit antibody 519 against the loop 2 domain of PS 1/2 
{panel A) and mouse mAb against BiP {panel B\ followed by appropriate TRITC- and FITC-conjugated 
secondary antibodies. Presenilin staining is observed in the cytoplasm as a reticular pattern {A) that 
partially colocalizes with the staining obtained with the mAb against the endoplasmic reticulum marker BiP 

(5). Panel C, fibroblasts stained with antibody 519 preabsorbed with peptide revealed only weak 
background fluorescence. 

[View Larger Version of this Ima^e (32K GIF file)] 



The same, lacelike network was seen in COS-1 cells transfected with plasmids containing the cDNA for 
PSl and using antiserum B14 against peptide NDNRERQEHNDRRS in the amino-terminal domain of PSl 
or antiserum B16 against peptide EGDPEAQRRVSKNSKY in the hydrophilic loop domain of PSl (Fig. 2, 
a and c). Untransfected COS-1 cells remained negative under the experimental conditions used, probably 
because of the very low levels of endogeneous PS present in these cells (see below). The same pattern of 
staining was observed with monoclonal anfibody EDS (32) against Serca 2a Ca^'^-ATPase (results not 
shown), and with antibodies against transfected reticulon/NSP (Fig. 2b), Both proteins are located in the 
endoplasmic reficulum and the Golgi apparatus (24). The distribution of PSl and reficulon/NSP remained 
identical in double transfected cells (compare panels a and b in Fig. 2). Transfected APP (Fig. 2e) and 
transfected fiirin (Fig. Id), in contrast, were mainly found in the Golgi apparatus, as shown previously (33, 
34) . Similar resuUs were obtained in CHO cells (results not shown). 

■ Fig. 2. Immunofluorescent staining of PSl in transfected COS ceils. 
COS-1 cells were double transfected with plasmids coding for PSl and 
reticulon/NSP {a and b) or single transfected with plasmids coding for 
PSl, amyloid precursor protein, or fiirin (c, rf, and e). Cells were fixed with 
formaldehyde and permeabilized with saponin. Immunostaining was done 
with anfiserum B14 against the amino-terminal domain of PSl (a) or B16 
against the loop domain of PSl (c), with a mix of monoclonal antibodies 
MON160-162 against reticulon/NSP {b\ or MON148 and 152 against 
fiirin {d), or mAb 22C1 1 against amyloid precursor protein (e), followed, 
by the appropriate TRITC-conjugated {red) or FITC-conjugated {green) 
secondary antibodies. Notice the fine reticular pattern obtained with PSl antibodies in panels a and c, 
which distributes with reficulon/NSP {panel b) in double transfected cells. Panel e demonstrates that 
amyloid precursor protein is located mainly in the Golgi apparatus, similar to furin {panel d), 
rView Larger Version of this Image n28K GIF file)] 



Biosynthesis of the Presenilins 
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Metabolic labeling of untransfected or "mock" transfected COS-1 or CHO cells using [^^S]methionine, 
followed by immunoprecipitation of the cell extracts using antibodies 519, B14, or B17 and resolution of 
the immunoprecipitates in SDS-PAGE, yielded no signals (Fig. 3A, lane 4) or, after prolonged exposures 
(2 weeks and more), only nonspecific signals in autoradiography (results not shown). Immunoprecipitation 
of detergent extracts of COS-1 cells transfected with plasmids containing the cDNA of wild type PSl, in 
contrast, yielded strong specific signals of radiolabeled protein migrating with an apparent molecular mass 
of 45 kDa (Fig. 3A, lanes 1-3). Diffuse protein bands with masses between 100 and 250 kDa were observed 
to a variable extent (Fig. 3A). Similar results were obtained with PS2 cDNA. The main PS2 species 
migrated, however, slightly more slowly than PSl, resulting in an apparent mass of 50 kDa (see below). 
Unrelated polyclonal antibodies, or untransfected cells did not yield these bands, while immunoblots of 
COS-1 cells transfected with Myc-tagged PSl and stained with the Myc tag-specific mAb 9E10 revealed 
again the pronounced smearing (Fig. 3B). This result clearly demonstrated that the 100-250-kDa protein 
smears consisted of PS protein, either associated or not associated with other proteins. Essential similar 
patterns were observed when PSl FADl (Ala-246 Glu) or PSl NIH2 (Cys-410 -> Tyr) were expressed 
(Fig. 3A). These clinical mutations (12) therefore do not cause major alterations in the biosynthesis of PSl 
protein when overexpressed in COS-1 cells (Fig. 3A). Independent experiments performed in CHO cells 
confirmed completely these results (results not shown). 

Fig. 3. Biosynthesis of PSl protein in COS-1 cells. Panel A, COS-1 
cells were transfected with constructs containing the cDNA coding for 
wild type PSl (PS J WT), PSl FADl (Ala-246 -* Glu), PSl NIH2 
(Cys-410 -* Tyr), or with expression vector alone {pSG5 control). Cells 
were metabolically labeled with ["'^SJmethionine for 4 h. Cells were solubilized, and PSl was 
immunoprecipitated with antiserum B17 (1/250). Immunoprecipitates were resolved by 4-20% gradient 
SDS-PAGE. Molecular size markers are indicated at the left in kDa. Panel B, Westem blotting of 
COS-1 cells transfected with PSl containing amino-terminally inserted Myc epitope. Cell extracts were 
electophoresed on a 4-20% gradient acrylamide gel and transferred to a nitrocellulose membrane. mAb 
9E10 against the Myc epitope was used to detect PSl. Panel C, combined immunoprecipitation and 
Westem blotting of PSl in untransfected COS cells. Detergent extracts of 10 x 10^ untransfected COS-1 
cells were made, and PSl was immunoprecipitated using antiserum B13 (N-term) or B17 (Loop). In lanes 
1,3,5, and 7, 0.5 x 10^ COS-1 cells transfected with wild type PSl were added to the untransfected cells. 
Immunoprecipitated material was resolved in 4-20% gradient SDS-PAGE and transferred to a 
nitrocellulose filter. Filters were reacted with mAb PS 1-3 and goat anti-mouse peroxidase-conjugated 
antibodies (lanes 1-4, indicated by PSl-3) or with goat anti-mouse peroxidase conjugated antibodies alone 
(lanes 5-8, -Co). The mobility of intact PSl is indicated by an arrow at the right. 
rView Larger Version of this Image (24K GIF file^] 



Since the levels of endogeneous PSl were not detectable using classical immunoprecipitation, a sensitive 
immunological "sandwich" type assay was developed. PSl was immunoprecipitated from detergent extracts 
of 10 X 10^ untransfected COS-1 cells using polyclonal PSl antisera. A small amount of transfected cells 
(0.5 X 10^) was added to the positive control samples. The immunoprecipitates were then resolved in 
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SDS-PAGE electrophoresis and transferred to a nitrocellulose filter. Immunoprecipitated PSl was finally 
detected with mAb PS 1-3 raised against peptide RRVSKNSKYNAESTERESQDTVAE in the loop 
domain of PSl. Weak signals representing intact endogeneousPSl were revealed in untransfected COS-1 
cells (Fig. 3C, lanes 2 and 4). Endogeneous PSl migrated with the same mobility (45 kDa) as transfected 
PSl, as is best seen in the experiments using the amino-terminal domain-specific antiserum B13 (Fig. 3C, 
lane 2). Since mAb PS 1-3 recognizes an epitope in the carboxyl-terminally located hydrophilic loop, the 
combination of these antibodies is expected to detect mainly intact PSl (Fig. 3C, lane 2). With hydrophilic 
loop-specific antiserum B17, in contrast, relatively pronounced 18-30-kDa fi-agments were visualized 
together with intact PSl . These fi-agments represent most likely carboxyl-terminal fi-agments of presenilin 
(35)- Remarkably, the observed fi-agments were not, or only marginally, increased in COS-1 cells 
overexpressing PSl (compare lanes 3 and lane 4 in Fig. 3C). Pulse-chase experiments on transfected 
CHO-cells (Fig. 4, A and Q and COS-1 cells (results not shown) fiirther showed that the transfected 
45-kDa PSl species has a half-life of about 4 h. No fi-agments were seen at any time point of this assay, 
either with amino-terminal (B13) or hydrophilic loop-specific (B17) antisera. APP, immunoprecipitated 
fi-om the same cell extracts (Fig. 45), displayed a much faster turnover (half-life: 2 h), indicating that the 
relative high expression of PSl did not interfere with the normal turnover of APP. 

Fig. 4. Pulse-chase metabolic labeling of PSl in transfected CHO 
cells. CHO cells expressing stably human APP770 were transfected with 
PSl and metabolically labeled with [^^S]methionine during 10 min in 
methionine-fi-ee medium. Incorporated label was chased by incubation 
of the cells in complete medium for the indicated time. PSl was immunoprecipitated fi-om the cell extracts 
using B13 (N-term) or B17 {Loop) antiserum and resolved by 4-20% gradient SDS-PAGE {panel A). APP 
was consecutively immunoprecipitated firom the same cell extracts using antibody 207 (45, 47) and 
analyzed in 6% homogeneous SDS-PAGE. Notice the difference in turnover of transfected presenilin 1 and 
transfected APP. Panel C displays a quantitative analysis (mean ± S.E.) of three independent pulse-chase 
experiments of PSl. Signals were quantitated using a Phosphorlmager. The points represent the fi-action of 
PSl synthesized during a 10-min pulse and remaining after the period of chase as indicated. The results 
obtained for APP in two experiments are also displayed. 
rView Larger Version of this Imaee (14K GIF file)] 



Posttranslational Modification and Aggregation of Presenilins 

The problem of the presenilin smears in SDS-PAGE was fiirther investigated. Enzymatic digestion of 

immunoprecipitated PSl and PS2 protein with glycosidase F (3 milliunits/nl), 0-glycanase 

(60 nanounits/^l), endoglycosidase H (60 nanounits/^l), sialidase (100 nanounits/^l), or combinations of 

these enzymes did not affect the mobility of the proteins, indicating that no glycosylation of PSl and PS2 

occurred (results not shown). Digestions with heparinase (0.1 milliunits/nl), heparitinase 

(0.1 milliunits/nl), chondroitinase AC (5 milliunits/^1), and chondroitinase ABC (5 milliunits/|al) also had 

no effect on the mobility of the protein smears, indicating that PSl is not modified by glycosaminoglycan 

chains (Fig. 5). These negative results were independently confirmed by metabolic labeling experiments 
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using [^H]glucosamine or [ SJO^. While both radioactive precursors were incorporated in proteins in the 

cell extracts of the labeled cells, no signal was obtained when transfected PSl was immunoprecipitated 
(results not shown). Experiments using [^H]palmitic or [^H]myristic acid demonstrated also that the 
presenilins did not incorporate fatty acids. 

Fig. 5. Presenilins are not modified by glycosaminoglycan chains. PSl was 

immunoprecipitated using serum B16 from metabolically labeled COS-1 cells 
transfected with wild type PSl and digested with heparinase, heparitinase, 
chondroitinase ABC, or chondroitinase AC. Digested material was resolved in 
12% homogeneous SDS-PAGE. The activity of the heparinase and heparitinase 
was confirmed using proteoglycans immunopurified from fibroblasts (50). The 
activity of chondroitinase ABC and AC was demonstrated using a test solution 
of chondroitinesulfate which was clarified after addition of the enzymes. 
Molecular size markers are indicated at the left in kDa. 
rView Larger Version of this Image (39K GIF file)] 



Sonication or boiling of the sjimples in SDS and 6 M urea (data not shown) or extraction of the cells in the 
presence of 5 mM dithiothreitol did not resolve the aggregates (Fig. 6). Denaturation of the 
immunoprecipitates at 37 °C instead of at 95 °C reduced the aggregates considerably but not completely 
(Fig. 6, lane 2). This demonstrated that the aggregates consisted mainly, if not exclusively, of PSl as the 
radiolabeled species (Fig. 6) and also suggested that the PS aggregates are at least partially produced during 
the processing of the samples for electrophoresis. It should be noticed, however, that even when freshly 
prepared material was used and heating of the samples was avoided, protein smears in the 100-250-kDa 
region remained visible in SDS-PAGE (Fig. 6). 

Fig. 6. The presenilin aggregates are temperature-sensitive. 

Immunoprecipitated PSl (antiserum B16) from transfected COS cells was 
denatured for 10 min with SDS and P-mercapthoethanol at 95 °C {lanes 1 and 3) 
or at 37 °C {lane 2), In lane 3 cell extracts were prepared in the presence of 5 mM 
dithiothreitol {DTT), Notice the decrease of the 100-250-kDa aggregates and the 
slightly increased intensity of the 45-kDa band in lane 2 as compared to lanes 1 
and 5. Samples were resolved on a 4-20% SDS-PAGE. 



rView Larger Version of this Image (28K Gff filell 



Phosphorylation of the Presenilins 

The high level of serines, threonines, and tyrosines in the primary amino acid sequences of both PSl and 
PS2 suggested the possibility that the PS proteins are phosphoproteins (12, 13). Transfected COS-1 cells 



///// 



250 — 
98 — 



36 — 

ao — 




f20 



7/23/01 3:48 PM 



: - De Strooper et al 272 (6): 3590 



http://www.jbc.Org/cgi/content/fuiI/272/6/359C 



(Fig. 7) and CHO cells (Fig. 85) incorporated -^"^P in presenilins. Acid hydrolysis of immunoprecipitated 

•^^P-phosphorylated PSl and PS2 yielded mainly phosphoserine (Fig. 75). Longer exposures revealed very 

weak signals for phosphothreonine, while phosphotyrosine was never observed. The phosphorylation of 

PSl was strongly increased, and that of PS2 slightly or not increased, by treating the cells with the 

phosphatase inhibitor okadaic acid (300 nM, Fig. 8^). Okadaic acid inhibits the two major classes of 

serine/threonine phosphatases (49). The protein kinase C inhibitor staurosporin (300 nM) or the agonists 

phorbol myristic acid (1 ^M) and phorbol dibutyric acid (1 )iM) and the protein kinase A agonist forskolin 

(100 |iM) had no effect on the phosphorylation extent or pattern (Fig. 8^). Attempts to phosphorylate 

immunoprecipitated PSl in vitro using protein kinase A or protein kinase C remained also negative (results 
not shown). 

Fig. 7. Phosphorylation of PSl and PS2 on serine residues. Panel a, 
transfected COS-1 cells were metabolically labeled using 
[ P]orthophosphate and post-nuclear cell extracts were prepared (see 
"Materials and Methods"). PSl and PS2 were immunoprecipitated with 
antiserum Bl 6 (PSl) or 519 (PS2). COS-1 cells transfected with pSG5 
were labeled, extracted, and immunoprecipitated with the same 
antibodies (-). Panel b, thin layer chromatography of hydrolyzed PSl 
(45 kDa) and PS2 (50 kDa) identifying serine as the phosphorylated 
residues. The mobility of phosphoserine (5), phosphothreonine (7), or 
phosphotyrosine {Y),as detected by ninhydrin staining, is indicated by the 
dotted lines. 

[View Larger Version of this Image (3 IK GIF file)] 
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Fig. 8. Analysis of PSl and PS2 phosphorylation. Panel A, 
staurosporin (300 nM) or okadaic acid (300 nM) were added to the 
medium of transfected COS cells after a 3.5-h incubation with 
[^"^PJorthophosphate. 30 min later, cell extracts were made as in Fig. 4. 
PSl or PS2 were immunoprecipitated using, respectively, antiserum 
B17 or 519 and resolved by a 4-20% gradient gel. Panel B, transfected CHO cells were labeled with 
[^^PJorthophosphate and PS was immunoprecipitated using antiserum 519 {PS2 WT) or B16 (all other 
lanes) and resolved by a 4-20% gradient gel. 
["View Larger Version of this Image (30K GIF file)] 



Cytoplasmic Orientation of the Amino-terminal Domain and Hydrophilic Loop Domain of Presenilin 1 

We finally addressed the issue whether the two major hydrophilic domains in presenilin 1, i.e, the 
amino-terminal domain and the hydrophilic loop domain (12) are oriented toward the cytoplasmic or the 
luminal side of the endoplasmic reticulum. Current models for the orientation of the presenilins in 
membranes are based on theoretical predictions (see "Discussion"). Both domains are candidate regions for 
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interactions with other proteins, and their orientation determines whether this interaction occurs with 
cytoplasmic or endoplasmic reticulum proteins. To analyze this question, we used low concentrations of 
digitonin to selectively permeabilize the plasma membrane (see "Materials and Methods") and antibodies 
against the amino-terminal domain (antiserum B 14), against the Myc tag introduced at the amino terminus 
(mAb 9E10), against the second loop domain (antibody 519), and, finally, against the hydrophilic loop 
domain (antiserum B 16) of PS 1. To monitor the permeabilization procedure, rat monoclonal antibody 
against the luminal endoplasmic reticulum retention signal KDEL was used QO, 31). Antibodies against the 
amino-terminal domain of PSl, against the amino-terminally inserted Myc tag (Fig. 9a) or against the 
hydrophilic loop (Fig. 9e) reacted with PSl in digitonin-permeabilized cells under conditions in which the 
KDEL-specific monoclonal antibody did not result in labeling (Fig. 9, a and e). On the other hand, 
antibodies against the second hydrophilic loop domain (519) reacted with PSl when saponin (Fig. 9d), but 
not when digitonin (Fig. 9c) was used to permeabilize the cells, suggesting a luminal localization of this 
domain. 

Fig. 9. Cytoplasmic orientation of the amino-terminal domain and the 
hydrophilic loop of PSl. COS-1 cells transfected with PSlMyc (panels a and b) 
or with PSl WT (panels c-J) were permeabilized using digitonin (panels a, c, and 
e) or saponin (panels b, d, and J) and immunostained with anti-Myc mAb 9E10 
(panels a and 6), with polyclonal antibody 519 against the second loop domain 
(panels c and d), with polyclonal antibody Bl 6 (panels e and y) and with rat 
monoclonal antibody against the KDEL sequence (panels a-f). PSl 
immunoreactivity is observed as red/yellow staining, while KDEL 
immunoreactivity is green. Panels a and b show that the amino terminus of PSl is 
oriented to the cytoplasmic side of the endoplasmic reticulum membrane. Cells 
that express PSlMyc stain strongly with the amino-terminal domain directed 9E10 mAb both in digitonin 
(a) or saponin (b) permeabilized cells. When saponin is used, the untransfected cells are stained with the 
KDEL antibody (green) since it reacts with endogenous proteins (b). When digitonin is used, no staining is 
observed (a), indicating that the endoplasmic reticulum membrane is not permeable for antibodies under 
the conditions used. In panels c and d, similar experiments are displayed but using antiserum 519 against 
the second loop domain. Staining is only observed when saponin is used, indicating a luminal orientation of 
this domain (panel d). Panels e and/ demonstrate that antiserum B16 against the hydrophilic loop domain 
of PSl react both with digitonin (e) or saponin (/) permeabihzed cells, indicating a cytoplasmic orientation 
of the hydrophilic loop. 

[View Larger Version of this hnage (9 IK GIF file)] 



DISCUSSION n 

The current investigation provides a detailed characterization of the biosynthesis of presenilins 
overexpressed in COS-1 cells imd CHO cells. The results show that the PS proteins are unglycosylated 
phosphoproteins with apparent molecular masses of about 45 and 50 kDa in SDS-PAGE. They have a 
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Strong tendency to form SDS-resistant complexes with apparent molecular masses between 100 and 
250 kDa. Double immunofluorescence studies showed that PSl is mainly located in the endoplasmic 
reticulum and the Golgi apparatus and that its amino-terminal domain and the hydrophilic loop domain are 
oriented to the cytoplasmic side. 

The localization of transfected PSl in the endoplasmic reticulum of COS- 1 cells or CHO cells is not a 
simple consequence of overexpression of the protein or nonspecific general effects on the biosynthetic 
pathway in these cells. First, endogeneous presenilins are located in the endoplasmic reticulum of 
untransfected cells as demonstrated in Fig. 14 using confocal laser microscopy. Second, using identical 
transfection conditions, APP and furin were found to accumulate in the Golgi apparatus, as demonstrated 
before (33, 34). Third, the turnover (Fig. 4) and the secretory processing of APP^ in CHO cells stably 
expressing human APP770 was not affected by transfection of PSl, ruling out general inhibitory effects on 
protein transport and processing. On the other hand, the levels of endogeneous presenilin in COS-1 cells 
are apparently very low. Only the combination of immunoprecipitation to concentrate PSl from detergent 
extracts of 10 x 10^ cells, followed by immunoblotting using mAb PS 1-3, allowed us to detect 
endogeneous PSl migrating at the same molecular weight as transfected PSl. The disadvantage of this 
approach is that the antibodies from the immunoprecipitation step interfere with the consecutive detection 
of possible PS-aggregates in the immunoblotting step (Fig. 3Q. This assay also precludes a dynamic 
analysis of the metabolism of the presenilins using metabolic labeling and pulse-chase experiments. On the 
other hand, it allowed us to demonstrate the presence or absence of particular antibody epitopes on PSl 
fragments. Importantly, the use of hydrophilic loop-specific antibodies in the immunoprecipitation step 
resulted in the detection of relative large amounts of, presumably proteolytic, fragments of PSl in the 
18-30-kDa range (Fig. 3Q. The level of these fragments was not, or only slightly, increased in COS-1 cells 
that overexpressed PSl, which explains why they were not readily detected in the transfection experiments. 
The proteolytic process involved is probably easily saturated or even inhibited by PS overexpression (35, 
36). Our data therefore do not allow us to speculate any ftirther on the exact nature or the biological 
significance of this process. We can only conclude that overexpression studies of the type used here are not 
suitable to study this particular aspect of presenilin metabolism. On the other hand, it is clear that 
untransfected cells (Fig. 3) and brain tissue in vivo (36,37) contain detectable amounts of intact presenilin. 
Since this endogeneously expressed PSl has the same mobility as transfected PSl, the conclusion that 
transfected PSl (and PS2) are not subject to glycosylation, sulfation, glycosaminoglycan modification, 
palmitoylation, or myristoylation also holds apparently true for the endogeneous protein. These 
posttranslational modifications are thereby also excluded as contributing to the formation of the high 
molecular mass, SDS-resistant aggregates (100-250 kDa) in SDS-PAGE electrophoresis. These aggregates 
were observed to a variable extent in all our experiments, and were noticed by others using other cell types 
or brain tissue (36, 38, 39). Since the smears were detected with three different antibodies recognizing three 
different epitopes, and by a Myc-directed mAb using a Myc-tagged PS 1 construct, the aggregates must 
contain presenilin core proteins, alone or associated with unidentified components. Denaturation of 
immunoprecipitates at 37 °C instead of 95 °C, resulted in less aggregates and increased amounts of the 
45-kDa PSl band (Fig. 3Q. The aggregates therefore probably consist mainly of oligomers of presenilins. 
It is unclear whether this property to form SDS-resistant aggregates in vitro has any physiological 
significance in vivo, but immuno-electron microscope observations suggest that clustering of presenilins 
also occurs in the endoplasmic reticulum of untransfected cells.^ It should be envisaged that in pathological 
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conditions exacerbated aggregation of presenilins could become a problem. In this context, it should be 
mentioned that antibodies against PSl stain amyloid plaques in the brains of Alzheimer's disease patients 
(40). 

Importantly, the two point mutations causing Alzheimer's disease that were studied here, i.e. PSl Ala-246 
Glu (FADl kindred) and PSl Cys-410 Tyr (NIH2 kindred), did not alter the biosynthesis or level of 
expression of the PSl protein in COS-1 cells and CHO cells. These mutations must cause therefore more 
subtle effects. The possibility that they interfere with the fragmentation of PSl, as was shown recently for 
two other mutations (35, 36), cannot be excluded from the current data. Analysis of brains of transgenic 
mice expressing the PSl Ala-246 Glu mutant indicate, however, that at least this mutation does not 
interfere with the proteolytic processing of PSL- Since all PS mutations known to date act in a dominant 
way in the pathogenesis of Alzheimer's disease, a gain of fiinction is most likely. Evidence that fibroblasts 
derived from patients with presenilin mutations produce more and longer forms of the Pa4 amyloid peptide 
suggests that the mutations exert their effect on cellular metabolism or trafficking of amyloid precursor 
protein (21). The localization of PSl in the early compartments of the biosynthetic pathway makes it 
unlikely that the presenilin mutations directly influence a- and/or ^-secretase activity, since these operate in 
the late-Golgi and transport vesicles, at the cell surface and in endosomes (27, 41-43 ). The possibility that 
PS mutants influence the balance between amyloidogenic and non-amyloidogenic processing of APP in an 
indirect way by changing its intracellular trafficking should now be further explored in polarized 
Madin-Darby canine kidney cells and neurons (16, 44, 45) . 

Our study demonstrates fiirthermore that the presenilins are phosphorylated on serine residues. 
Phosphorylation of PSl and PS2 was evident both in COS-1 cells and CHO cells, but the phosphorylation 
of PSl was less intense and more variable than of PS2. The observation that okadaic acid enhanced the PSl 
phosphorylation suggests that PSl is more prone to phosphatase activity than PS2 (Fig. 8^). Protein kinase 
C is not responsible for PS phosphorylation, since neither stimulation of cells by phorbol esters nor purified 
kinase added to immunoprecipitated PSl or PS2 resulted in increased labeling. It is therefore unlikely that 
the presenilins are directly involved in the regulation of APP secretion by protein kinase C (46, 47). 

Finally, we showed that the hydrophilic NH2-terminal and the major loop domain of PSl are exposed to 

the cytoplasmic side of the endoplasmic reticulum membrane. Digitonin at low concentrations selectively 

permeabilizes the cell membrane, but not the endoplasmic reticulum membrane (29). Antibodies directed 

toward the amino-terminal domain of PSl (Fig. 9) displayed similar immunofluorescent staining patterns in 

digitonin and saponin permeabilized cells. Control experiments with antibodies against a luminal epitope, 

Le. the KDEL endoplasmic reticulum retention signal (30, 3i), showed that digitonin did not permeabilize 

the endoplasmic reticulum membrane. For the major hydrophilic loop domain, essentially identical results 

were obtained. The conclusion therefore is that the two major hydrophilic domains in PSl are oriented to 

the cytoplasmic side of the endoplasmic reticulum, which directs the search for candidate proteins 

interacting with these domains toward the cytoplasm. Consistent with this conclusion, antibodies against 

the second loop domain reacted with PSl after saponin, but not after digitonin permeabilization. This not 

only confirms the luminal localization of the recognized epitope, but also validates the approach as it was 
used. 

While more detailed studies, including electron microscopy and enzymatic digestion protection assays, are 
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required to settle definitively the orientation of the presenilins in the endoplasmic reticulum membrane, our 
results clarify already two important issues. The first problem is the orientation of the first transmembrane 
domain in PSl, which determines in fact the topology of all the following transmembrane domains (46). 
The positive charge difference between the 15 carboxyl-terminal and the 15 amino-terminal residues 
flanking the first membrane spanning domain suggests a luminal orientation of its amino terminus (48). 
However, several exceptions to this rule are known, mainly of proteins of which the amino-terminal 
domain contains more then 17 charged amino acid residues (48). In PSl, this domain contains 29 charged 
residues, which would explain its cytoplasmic orientation. The second issue, which has been pointed out by 
others before (9, 13, 17), is whether two putative hydrophobic stretches in the PS amino acid sequence 
flanking the hydrophilic loop, can additionally span the membrane, which would make the presenilins nine 
transmembrane domain proteins. Our data are compatible with a seven-transmembrane domain model with 
the amino-terminal domain and the hydrophihc loop located in the cytoplasm (Fig. 10; see also Note Added 
in Proof). In conclusion, the current study has analyzed the biosynthesis and the subcellular localization of 
the presenilins and provides a basis for the further study of their cell biology and their possible interactions 
with integral membrane and cytoplasmic proteins such as APP or Tau, both implicated in the pathogenesis 
of Alzheimer's disease (4). 

Fig. 10. Localization of antibody epitopes and orientation of PSl in 
the endoplasmic reticulum membrane. PSl is inserted in the 
endoplasmic reticulum membrane with both its amino-terminal (NH2-) 

and hydrophilic loop domain at the cytoplasmic side. The localization of 
the peptides used to raise the different antibodies used to study the 
topology of PSl are indicated by bars above the sequence (see also "Materials and Methods" and Note 
Added in Proof). 

rview Larger Version of this Image TDK GIF file)] 
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Note Added in Proof El 

An eight-transmembrane model, with hydrophobic region VEQ in the loop domain spanning the 
endoplasmic reticulum membrane, is also compatible with these data. This would result in a cytoplasmic 
orientation of the carboxyl-terminal domain of PSl. 
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A novel cytoplasmic protein-tyrosine phosphatase (PTPase) designated PTP20 was isolated from a PC12 
cDNA library and shown to positively regulate the differentiation process in PC 12 cells. The PTP20 open 
reading frame of 453 amino acids contains a single tyrosine phosphatase catalytic domain and displays 
closest homology to members of the PTP-PEST protein-tyrosine phosphatase family. Transient expression 
of PTP20 in Rat-1 cells resulted in the expression of a 50-kDa protein which exhibited PTPase activity in 
vitro. Expression of the 2.3-kilobase PTP20 mRNA increased during differentiation of nerve growth factor 
(NGF)-stimulated PC 12 cells. Consistent with this observation, stable overexpression of PTP20 in PC 12 
cells resulted in accelerated neurite formation following NGF freatment. These findings suggest a positive 
regulatory role of PTP20 in NGF-dependent neuronal differentiation of PC12 cells. 
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Protein tyrosine phosphorylation has been shown to play a crucial role in regulating fundamental cellular 
processes including proliferation, differentiation, and tumorigenesis Q). Phosphorylation of tyrosine 
residues occurs as a cellular response to a variety of stimuli including growth factors, hormones, or 
cytokines (2). The initial event triggered by these stimuH include an ordered process of activation, 
dimerization, and autophosphorylation of receptor tyrosine kinases. This in turn initiates an intracellular 
signaling cascade which involves the phosphorylation of several other key regulatory proteins. The balance 
of tyrosine phosphorylation is tightly controlled by the opposing activities of protein-tyrosine kinases and 
protein-tyrosine phosphatases (PTPases)- (3). 

Since the recent discovery that the tyrosine kinase Trk (tropomyosin receptor kinase) is the receptor for 
nerve growth factor (NGF) (4, 5, 6), protein tyrosine phosphorylation has been demonstrated to play an 
important role in neuronal differentiation. Upon NGF stimulation, PC 12 cells can differentiate from an 
endocrine cell phenotype to a sympathetic neuronal phenotype (7). The role of tyrosine kinases in the 
differentiation process in PC 12 cells has been partially elucidated and begins when NGF binds to the 
extracellular domain of its cognate receptor. This results in stimulation of tyrosine kinase activity and 
consequent activation of an ordered cascade of phosphorylation events involving p2r''^, Raf-1 kinase, and 
mitogen-activated protein (MAP) kinases (8, 9). 

Although the importance of protein-tyrosine kinases, like Trk, in neuronal differentiation is well 
characterized, the involvement of PTPases in neuronal function and differentiation is still unclear. It has 
been demonstrated that the protein-tyrosine phosphatase inhibitor orthovanadate can inhibit the 
NGF-induced differentiation of PC12 cells, suggesting that some PTPases are involved in the neural 
differentiation process (10). To begin to address the importance of PTPases in neuronal differentiation, we 
attempted to isolate cDNAs which encode PTPases expressed during the differentiation process of PC12 
cells. In this paper, we show the molecular cloning of a novel PTPase, named PTP20, and demonstrate that 
this PTPase might play a positive regulatory role in NGF-induced differentiation of PC12 cells. 



EXPERIMENTAL PROCEDURES O 

PCR Amplification and cDNA Cloning of Rat PTP20 

Degenerate oligonucleotide sense and antisense primers were based on consensus sequences for two highly 
conserved amino acid stretches within the catalytic domains of PTPases: FWAMAW and HCSAG(S/W)G. 
Random-primed cDNA (up to 50 ng) from PC12 cell RNA was used as a template for PCR. Both sense and 
antisense primers were added to a lOO-^il reaction mixture containing 20 mM Tris-HCl (pH 8.4), 50 mM 
KCl, 2.5 mM MgCl2, 0.01% bovine serum albumin, all four dNTPs (each at 200 ^M), 1 unit of 
polymerase (Boehringer Mannheim), and template cDNA. Thirty-five cycles were curried out on a thermal 
cycler; each cycle involved incubation at 94 °C for 1 min, at 42 °C for 1 min, and 72 °C for 1 min. The 
PCR products were separated on a 1.5% agarose gel. Fragments of 350-400 bp were excised, subcloned 
into pBluescript KS(+), and sequenced. The PTP20 PCR fragment was isolated, radioactively labeled by 
random priming, and used to screen 1 x lo^ plaques from a PC 12 cDNA library which had been made 
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using a pool of polyCA)"^ RNA from both undifferentiated and differentiated PCI 2 cells and a AZAPII 
synthesis kit (Stratagene). Hybridization was performed in a solution containing 50% (v/v) formamide, 
5 X SSC, 5 X Denhardt's solution, 0.05 M sodium phosphate, 1 mM NaH2P04, 1 mM Na4P20-7, 0.1 mM 

ATP, 5 ^g of salmon sperm DNA at 42 °C for 20 h. Washing was repeated three times with 2 x SSC, 0.1% 
SDS for 20 min at 42 °C. Six positive clones were obtained and plaque-purified by secondary screening. 
Positive clones were rescued according to the manufacturer's instruction and sequenced in both directions. 

Site-directed Mutagenesis 

The PTP20 mutant containing a cysteine to serine alteration at position 229 was generated using a 
oligonucleotide primer, CTCTGTGTCCACAGCAGTGCTGGCTGT, according to the protocol of Kunkel 

(11) . The mutation was confinned by DNA sequencing. 

Northern Blot Analysis 

Total RNA was extracted from PC 12 cells by acid guanidium isothiocyanate-phenol-chloroform method 

(12 ) . Poly(A)'^ RNA was isolated with oligo(dT)-Sepharose (Stratagene) column chromatography 
according to the manufacturer's instruction. Two micrograms of poly(A)"^ RNA was electrophoresed in a 
formaldehyde, 1.0% agarose gel, blotted to a nitrocellulose membrane filter, and hybridized to -^^P-labeled 
full-length PTP20 cDNA as a probe. 

Transient Expression of PTP20 in Rat-1 Cells 

The insert of PTP20 was cut out with EcoR[ digestion and integrated into an expression vector, pcDNA3 
(Invitrogen), which had been digested with the same restriction enzyme. The direction of the insert in the 
plasmid was confirmed by restriction mapping. Rat-1 cells were transfected with the plasmid (2 |ig/l x 10^ 
cells) by using Lipofectin (Life Technologies, Inc.). After 48 h of culturing, the cells were washed with 
phosphate-buffered saline and then lysed with lysis buffer (50 mM HEPES, pH 7.5, containing 150 mM 
NaCl, 1 mM EDTA, 10% (v/v) glycerol, 1% (v/v) Triton X-100, 1 mM phenylmethylsulfonyl fluoride, 
1 mM sodium orthovanadate, 10 ^g/ml aprotinin). Protein concentrations of cell lysates were measured 
with a protein assay kit (Bio-Rad) using bovine serum albumin as a standard. Equivalent amounts of 
protein were used for Western blot analyses and phosphatase activity assay. 

Stable Expression ofPTPlO in PCI 2 Cells 

Rat pheochromocytoma cells (PCI 2) were cultxired in Dulbecco's modified Eagle's medium containing 
high glucose (4.5 g/liter) supplemented with 10% heat-inactivated horse serum and fetal calf serum. 
5x10^ cells per 60-mm dish were incubated overnight in 4 ml of growth medium. The following day, the 
dish was washed once with serum-free medium and then incubated with a Lipofectin (5 iiil)-DNA (2 ^ig) 
mixture for 6 h. After 48 h, selection started in growth medium containing 500 ng/ml G418 (Life 
Technologies, Inc.). Following 5 weeks of selection, discrete colonies were subcloned and expanded. 

Assay of PTPase Activity 

Phosphatase activity of PTP20 was assayed in a 200-^1 solution containing 25 mM MES, pH 5.5, 1.6 mM 
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dithiothreitol, 10 mM p-nitrophenyl phosphate (pNPP) as a substrate, and 50 of protein cell lysate at 
37 °C for 30 min. The reaction was stopped by the addition of 100 ^1 of 1 N NaOH, and the absorbance 
was measured at 405 nm. 

Western Blot Analyses 

Cells were lysed in lysis buffer (detailed above). To assess PTP20 expression, equivalent amounts of 
protein in the cell lysates were separated by 10% SDS-polyacrylamide gel electrophoresis and 
electrophoretically transferred to nitrocellulose membranes. The membranes were first incubated with 
rabbit anti-PTP-PEST antibodies, and then, as secondary antibody, a peroxidase-coupled goat anti-rabbit 
antibody (Bio-Rad) was added, followed by an enhanced chemiluminescence (ECL) substrate (Amersham) 
reaction. The substrate reaction was detected on a x-ray film (Amersham). The anti-PTP-PEST antibody 
was raised against the carboxyl-terminal 56 amino acids of human PTP-PEST (13) which was expressed as 
a glutathione i'-transferase fusion protein. 



RESULTS AND DISCUSSION D 

In order to identify novel PTPase genes involved in differentiation of PC 12 cells, we performed reversed 
transcription-based PCR amplification using poly(A)"^ RNA fi-om both undifferentiated and differentiated 
PC 12 cells as a template and degenerate primers that were deduced fi-om highly conserved amino acid 
sequences within the PTPase catalytic domain. This reaction gave specific products of 350-400 bp. These 
fi-agments were isolated, cloned, and sequenced. Sequencing analyses revealed the presence of a number of 
different known PTPase clones including LAR (14), PTPa (LRP) (15), RK-PTP (16), PC12-PTP1 (17), 
PTP-IB (14), PTP-2E (18), SHP-2 (PTP-ID) (19), rat homologues of SHP-l (PTP-IC), and MEG2. PCR 
clone number 20 (PTP20) exhibited sequence similarities but was not identical to any previously known 
PTPases. Using the PTP20 PCR-generated cDNA fi-agment as a probe, we isolated a full-length cDNA 
clone fi-om a PC 12 cDNA libniry and characterized it by sequence analysis. 

The cDNA sequence and the deduced amino acid sequence of PTP20 are shown in Fig. i. The 2226-bp 
cDNA clone of PTP20 contained an open reading frame of 1359 bp preceded by 27 base pairs of 
5 -noncoding region and 840 base pairs of 3 -noncoding region. The 3 -noncoding region contained the 
polyadenylation signal sequence AATAAA. The open reading frame encoded a protein of 453 amino acids 
with a predicted molecular mass of approximately 50 kDa. Hydropathy analysis (20) indicated that PTP20 
contained no hydrophobic segments appropriate for signal peptide or transmembrane domains, and, 
therefore, PTP20 is most likely an intracellular protein. The transcripts corresponding to nearly the same 
size of the full-length cDNA were detected in several rat tissues including brain, liver, lung, spleen, skeletal 
muscle, kidney, and testis (not shovra). 



10 



7/7.^/01 VdQ PM 



Z - Aoki et al. 271 (46): 29422 



http://www.jbc.org/cgi/content/fulI/27 1 /46/2942: 



rirrrry^ii^^ • F>g- 1- Nucleotide and deduced amino acid sequences of the cDNA encoding 

L-Z^rr^if '^^ PTP20. Nucleotide and amino acid residues are numbered on the right. 

r^:zy'rjr-^-^j^y^i^ The amino acid consensus sequence (VHCAAGXm) found in all PTPases and 

polyadenylation signal sequences are underlined. PTPase domain is boxed. An 




j; asterisk indicates the stop codon. 



[View Larger Version of this Image ("77K GIF file)] 



The PTPase domain was located near the predicted amino terminus between residues 58 and 283. The 
catalytic domain of PTP20 showed the highest homology with the PTP-PEST family phosphatases, such as 
human and rat PTP-PESTs (H, 2D and PEP-PTP (22). Proline (P), glutamate (E), serine (S), and threonine 
(T) residues are enriched in the PEST-motif sequence, which is not arranged in any specified consensus 
sequence (23). PTP20 may also have a PEST sequence between amino acids 285 and 453, which suggests 
that PTP20 is a member of the PTP-PEST family. In addition, the last 22 amino acids of PTP20 also had 
homology to other PTP-PESTs, and, accordingly, antibodies raised against the human PTP-PEST carboxyl 
terminus cross-react with PTP20. The antibodies were, therefore, used for subsequent experiments as 
described below. 

To confirm that PTP20 encodes a functionally active PTPase, Rat-1 fibroblast cells were transiently 
transfected with mammalian expression constructs encoding either PTP20 or a Cys Ser mutant of 
PTP20. Cell lysates were prepared as described under "Experimental Procedures," and protein 
concentrations were determined. The expression level of both wild type and catalytically inactive mutant 
PTP20 was confirmed by Western blotting with anti-PTP-PEST antibodies. Cross-reactivity with 
nonspecific proteins was not detected as evidenced by lack of signal in the control lane (Fig. 2A)- Nearly 
equivalent amounts of expressed protein were detected. The size of the detected protein was 50 kDa which 
is consistent with the predicted molecular mass of PTP20. For protein-tyrosine phosphatase activity, 
equivalent amounts of protein fi-om the transfected Rat-1 cell lysates were tested using pNPP as a substrate. 
Lysates fi-om transfected cells exhibited an approximately 2.5-fold higher PTPase activity over those fi-om 
control cells, whereas only basal levels of PTPase activity were detected in lysates fi-om cells transfected 
with a construct encoding a catalytically inactive mutant of PTP20 (Fig. IB). These results indicate that 
fiill-length PTP20 cDNA encodes a fimctionally active PTPase. 
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Fig. 2. PTP20 is a functionally active PTPase. A, the expression of PC12-PTP (WT) 

and its catalytic mutant (C/S) was assessed by Western blotting with antibodies 

against human PTP-PEST. All lanes contained equivalent amounts of cell lysate 

(50 ^g). B, phosphatase activity of PTP20 was determined by hydrolysis of the 

PTPase substrate, pNPP. Hydrolysis of pNPP was quantified by measuring the 

absorbance at 405 nm. Cell lysates were prepared from Rat-1 cells which were 

transiently transfected with the expression vector pcDNA3 alone (control), the PTP20 

expression vector (WT), or with an expression vector encoding a catalytic mutant of 

PTP20 (C/S). The data are shown as the average ± S.E. of three independent 
experiments. 



[View Larger Version of this Image (26K GIF file)] 



To elucidate the role of PTP20 in the differentiation process of PC 12 cells. Northern blot analysis was used 
to examine the expression pattern of PTP20 mRNA in PC 12 cells treated with NGF for 3 or 6 days. 
Untreated PC12 cells exhibited a 2.3-kb PTP20 mRNA transcript (Fig. 3). Following 3 days of NGF 
treatment, a 1.5-fold increase in the amount of transcript was observed. Another 3 days of NGF treatment 
caused a 2.4-fold increase as compared to untreated cells. In addition to the predominant 2.3-kb transcript, 
a faint band of 1.5 kb in size was also detected which also increased in abimdance as NGF treatment 
continued. The expression pattern of PTP20 mRNA suggested that PTP20 might play a role during 
NGF-induced PC12 differentiation. 




Fig. 3. Expression of PTP20 in PC12 cells. A, Northem blot analysis was curried out 
with 2 ^ig of poly(A)'*" RNA fi-om PC12 cells treated with NGF (50 ng/ml) for 0, 3, or 
6 days. B, the expression level of the 2.3-kb transcript was quantified with a 
Phosphorlmager (Fuji Fikn). The data are representative of three independent 
experiments. 



[View Larger Version of this Image (28K GIF file)] 



To further elucidate the function of PTP20 in the differentiation process, PC 12 cells were stably transfected 
with the PTP20 cDNA mammalian expression construct. From 7 stably transfected clonal cell lines, three 
independent clones showed high levels of PTP20 expression as assessed by Western blotting (Fig. 4A, 
lanes 2-4), In parental PC 12 cells, endogenous PTP20 protein was beneath detection with the antibody 
(Fig. AA, lane 7). These three independent clones appeared morphologically indistinguishable fi-om parental 
PC 12 cells (Fig. 45). However, following NGF treatment (50 ng/ml), all three clones showed accelerated 
neurite outgrowth, with 20 to 40% of the cells expressing neurites of more than two cell bodies in length at 
day 1 and more than 70% of the cells expressing such neurites at day 3. In contrast, the parental PC 12 cells 
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showed less than 5% of the cells with neurites of two cell bodies in length at day 1 and 47% at day 3. At 
day 4 following NGF treatment, more than 70% of both parental PC 12 cells and PTP-PC 12 cells expressed 
neurite outgrowth; however, the neurite length and the abundance of neurites in PTP-PC 12 cells appeared 
longer and larger than those of parental PC12 cells. 

Fig. 4. Effect of overexpression of PTP20 on the differentiation of PCI 2 
cells induced by NGF. A, expression of PTP20 in PC 12 cells. PC 12 cells were 
stably transfected with PTP20 expression plasmid. Following selection in 
medium supplemented with G418, surviving colonies were subcultured and the 
level of PTP20 was assessed by Western blotting with anti-human PTP-PEST 
antibodies. The transfected cells were named PTP-PC12, and three 
independent clones (PC-YNl, -YN2, and -YN3) were tested. All lanes 
contained equivalent amounts of cell lysate (50 ^ig). Lane 7, PC12; 2, PC-YNl; 
3, PC-YN2; 4, PC-YN3. 5, effects of NGF or EGF treatment on PC12 and 
PTP-PC12 cells (PC-YNl). Cells were cultured in the medium supplemented 
with either NGF (50 ng/ml) or EGF (50 ng/ml), and photographs were taken on the day as indicated. 
Magnification x 100. C, response of PC12 and PTP-PC12 cells to NGF and EGF treatment. At the 
indicated days, 100 to 200 cells were counted and scored for the expression of neurites longer than one cell 
body. PC12; o, PC-YNl; A, PC-YN2; PC-YN3. Upper panel, kinetics of NGF-induced neurite 
outgrovrth in response to NGF (50 ng/ml); middle panel, dose-dependent response to NGF. Cells were 
counted after 3 days of treatment; lower panel, kinetics of EGF-induced neurite outgrov^h in response to 
EGF (50 ng/ml). All data shown here are representative of three independent experiments. 
[View LarRer Version of this Image (72K GIF file)] 



It has been reported that epidermal growth factor (EGF) triggers neuronal differentiation of PC12 cells 
when EGF receptors were overexpressed (24). Hanafusa and co-workers (25) have reported that, when 
overexpressed in PC 12 cells, v-Crk accelerates differentiation induced not only by NGF but also by EGF. 
To test the effect of EGF on PTP-PC 12 cells, the cells were treated with EGF (50 ng/ml). This 
concentration of EGF was chosen because it was consistent with the amounts of EGF used in experiments 
by Hanafusa and co-workers (25). Even after 4 days of EGF treatment, fewer than 5% of the PTP-PC 12 
cells and the parental PC 12 cells exhibited any morphological changes (Fig. 4, 5 and C, respectively). It is 
therefore unlikely that EGF stimulates neurite outgrowth of PTP-PC 12 cells, in contrast to the previous 
observations in PC 12 cells expressing v-Crk or the EGF receptor. 

EGF- and NGF-treated PC 12 cells share several common properties including rapid membrane ruffling, 
cell flattening, and increases cell adhesion (26). However, only NGF promotes neurite outgrowth. This 
difference was originally thought to be mediated by distinct intracellular signaling pathways stimulated by 
NGF (27). Recent studies have demonstrated, however, that the differential effects of NGF and EGF on 
PC 12 cell differentiation maybe defined by differences in the extent and duration of activation of the same 
pathway leading to MAP kinase activation (24, 28, 29). This strongly suggested the critical involvement of 
a phosphatase in this process. In this study, we demonstrated that overexpression of the protein-tyrosine 
phosphatase PTP20 causes accelerated differentiation of PC 12 cells following NGF stimulation. These data 
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suggest an involvement of PTP20 in the regulation of the signal transduction pathway leading to neuronal 
differentiation upon NGF stimulation. The target(s) of the action of PTP20 may therefore represent one or 
more components of the MAP kinase cascade which upon dephosphorylation remove a negative constraint 
and thereby result in a sustained MAP kinase signal. Alternatively, the role of PTP20 may be to negatively 
control cell proliferation and thereby promote differentiation, since preliminary data indicated that 
PTP-PC12 cells proliferate more slowly than parental PC 12 cells in the absence of exogenous growth 
factor addition (data not shown). 

It has been reported that the expression level of some PTPases such as LAR (28), PTP-Pl (29), and 

PC12-PTP1 (17) is up-regulated during NGF-induced differentiation of PC 12 cells. The results presented 

here indicate similar findings for PTP20 and furthermore demonstrated that ectopic overexpression of this 

PTPase promotes the differentiation process in PC 12 cells. This is the first demonstration of a specific 

PTPase to be involved in neuronal differentiation of PC 12 cells. Whether these findings represent a specific 

characteristic of PC 12 cells or reflect an analogous role of PTP20 in the development and maintenance of 
the nervous system remains to be established. 
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Mammalian Son-of-sevenless (mSos) functions as a 
guanine nucleotide exchange factor for Ras and Rac, 
thus regulating signaling to mitogen-activated protein 
kinases and actin djniamics. In the current study, we 
have identified a new mSos-binding protein of 50 kDa 
(p50) that interacts with the mSosl proline-rich domain. 
Mass spectrometry analysis and immunodepletion stud- 
ies reveal p50 as PACSIN 1/syndapin I, a Src homology 3 
domain-containing protein functioning in endocytosis 
and regulation of actin dynamics. In addition to PACSIN 
1, which is neuron-specific, mSos also interacts with 
PACSIN 2, which is expressed in neuronal and nonneu- 
ronal tissues. PACSIN 2 shows enhanced binding to the 
mSos proline-rich domain in pull-down assays from 
brain extracts as compared with lung extracts, suggest- 
ing a tissue-specific regulation of the interaction. Pro- 
line to leucine mutations within the Src homology 3 
domains of PACSIN 1 and 2 abolish their binding to 
mSos, demonstrating the specificity of the interactions. 
In situj PACSIN 1 and mSosl are co-expressed in growth 
cones and actin-rich filopodia in hippocampal and 
dorsal root ganglion neurons, and the two proteins co- 
immunoprecipitate from brain extracts. Moreover, epi- 
dermal growth factor treatment of COS-7 cells causes 
co-localization of PACSIN 1 and mSosl in actin-rich 
membrane ruffies, and their interaction is regulated 
through epidermal growth factor-stimulated mSosl 
phosphorylation. These data suggest that PACSINs may 
function with mSosl in regulation of actin dynamics. 



* This work was supported by Canadian Institutes of Health Re- 
search Operating Grant MT-15396 (to P. S. M.) and the KSln Fortune 
Program of the Medical Faculty of the University of Cologne (to M. P.). 
The costs of publication of this article were defrayed in part by the 
payment of page charges. This article must therefore be hereby marked 
"advertisement"* in accordance with 18 U.S.C. Section 1734 solely to 
indicate this fact. 

§ Supported by studentships from the Natural Sciences and Engi- 
neering Research Council and from Fonds de Recherche en Santa du 
Quebec. 

II Supported by National Institutes of Health, NINDS Grant NS22807 
(to Dr. Susan Hockfield of Yale University). 

§§ Present address: Dept. of Life Sciences, Ben-Gurion University, 
Beer Sheva 84105, Israel. 

^11 Supported by an Investigator Award from the Canadian Institutes 
of Health Research. To whom correspondence should be addressed: 
Dept. of Neurology and Neurosurgery, Montreal Neurological Institute, 
McGill University, 3801 rue University, Montreal H3A 2B4, Quebec, 
Canada. Tel.: 514-398-7355; Fax: 514-398-8106; E-mail: mcpm@ 
musica.mcgillca. 



Ras functions as a molecular switch in the transduction of a 
wide variety of growth and differentiation signals induced by 
extracellular ligands (1). Activation of Ras is mediated by sev- 
eral Ras-specific guanine nucleotide exchange factors (GEFs)^ 
that convert GDP-Ras into GTP-Ras (1). Prominent among 
these is mammalian Son-of-sevenless (mSos). mSos interacts 
through a C-terminal proline-rich domain (PRD) with the Src 
homology 3 (SH3) domains of Grb2, an adaptor protein that 
targets mSos to activated growth factor receptors (2-7). Addi- 
tionally, mSos interacts through the PRD vrith the endocytic 
adaptor proteins amphiphysin II (8) and intersectin (9, 10). 
These interactions may function to target mSos to Ras activa- 
tion on the endocytic pathway (11-13), 

In addition to the PRD, mSos contains a CDC25 homology 
domain, encoding Ras GEF activity (3-5, 14), and a Dbl homol- 
ogy domain, endowed with GEF activity for Rac, a member of 
the Rho superfamily of GTPases (15, 16). In fact, mSos is the 
prototype member of a family of bifunctional GEFs, including 
Ras-GRFl and Ras GRF-2, having dual specificity for Ras and 
Rac (16). Through its Dbl homology domain, mSos binds di- 
rectly to Rac (17). However, the GEF activity of mSos toward 
Rac appears to be unique relative to other Rho family GEFs in 
that it catalyzes guanine nucleotide exchange as part of a 
macromolecular complex with Eps8 and E3bl, two proteins 
fimctioning in growth factor signaling (18, 19). Rac activation 
has multiple effects in cells, the most prominent being alter- 
ations in the actin cytoskeleton leading to membrane ruffling 
and lamellipodia formation (20). Thus mSos, through its ability 
to activate Rac, is thought to play a functional role in groAvth 
factor-mediated regulation of actin dynamics (18, 21). 

To screen for novel mSos binding partners, we performed 
overlay assays of brain extracts with fusion proteins encoding 
the PRD of mSosl. A major mSosl-binding protein of 50 kDa 
was detected and purified by affinity chromatography. Mass 
spectrometry analysis identified the protein as PACSIN 1/syn- 
dapin L PACSIN 1 was originally identified based on its dif- 
ferential expression in intact and lesioned mouse brain (22), 
and syndapin I was independently identified through its SH3 
domain-dependent interaction with dynamin 1 (23). We will 



^ The abbreviations used are: GEF, guanine nucleotide exchange 
factor; CHAPS, 3-[(3-cholamidopropyl)dimethylammoniol-l-propane- 
sulfonic acid; CT, C-terminal; EGF, epidermal growth factor; GST, 
glutathione S- transferase; MEK, mitogen-activated protein kinase/ex- 
tracellular signal-regulated kinase kinase; mSos, mammalian son-of- 
sevenless; NT, N-terminal; PAGE, polyacryl amide gel electrophoresis; 
PBS, phosphate-buffered saline; PRD, proline-rich domain; SH, Src 
homology. 
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use the name PACSIN throughout to collectively refer to PAC- 
SIN and syndapin. Whereas PACSIN 1 is neuron-specific, its 
closely related homologue PACSIN 2 is expressed in brain and 
several nonneuronal tissues (24, 25). Interestingly, the 
PACSINs appear to be involved in regulation of endocytosis 
and the actin cytoskeleton. Through a C-terminal SH3 domain, 
the PACSIN isoforms interact with the endocytic regulatory 
enzymes djmamin 1 and synaptojanin 1, as well as with N- 
WASP, a stimulator of Arp2/3-mediated actin nucleation and 
assembly (23, 25, 26). Overexpression of full-length PACSIN 
stimulates cortical actin assembly, leading to filopodia forma- 
tion, and the PACSINs localize to sites of high actin turnover, 
such as filopodia and lameUipodia (25). 

After identifying PACSIN 1 as a mSosl binding partner, we 
confirmed the interaction in vitro and used co-immunoprecipi- 
tation analysis to demonstrate the interaction in vivo. Inter- 
estingly, mSosl co-distributes with PACSIN 1 in the growth 
cones and filopodia of cultured hippocampal neurons, and both 
proteins co-localize vnth actin in the filopodia from growth 
cones of dorsal root ganglia neurons in culture. Further, 
PACSIN 1 and mSosl are co-localized in growth factor-induced 
membrane ruffles in COS-7 cells, and their interaction is reg- 
ulated by mSosl phosphorylation. Together, these data provide 
further evidence for a role for mSos in regulation of the actin 
cytoskeleton. 

EXPERIMENTAL PROCEDURES 
Ann'6orfres— Afnnity-purined antibodies against PACSIN 2 (26) and 
amphiphysin I and II (27) were described previously. Polyclonal anti- 
serum 2704 against rat syndapin I was a generous gift of Dr. Regis 
Kelly (University of California, San Francisco) (23). Monoclonal anti- 
bodies against a-tubulin and FLAG epitope (Sigma), tetra-His epitope 
(Qiagen). dynamin 1 (HUDY-1) (Upstate Biotechnology Inc.), and a 
polyclonal antibody C23 against mSosl (Santa Cruz Biotechnology) 
were obtained commercially. 

DNA Constructs and Recombinant Proteins— Hisg-tagged rat syn- 
dapin I (25) and Flag-tagged mouse Sosl (28) in mammalian expression 
vectors were generous gifts of 0r. Regis Kelly (University of California, 
San Francisco) and Dr. Jeffrey Pessin (University of Iowa), respectively! 
A protein construct encoding the SH3 domain of rat syndapin I (resi- 
dues 376-441) was generated by polymerase chain reaction with Vent 
DNA polymerase (New England Biolabs) using full-length cDNA as 
template and the following primers: forward, 5'-CGCCTCGAGCG- 
GATCCAACCCCTTCGAGGACGATGC-3'; reverse, 5'-CGGAATTC- 
CTATATAGCCTCAACGTAGTTG-a'. The resulting polymerase chain 
reaction product was digested with Bamlil and EcoBl and cloned in- 
frame into the corresponding sites of pGEX-2T. Mouse Sosl cDNA was 
used as a template to generate the following GST fiision proteins- 
GST-NT (residues 1111-1228) and GST-CT (residues 1223-1341). 
GST-NT was generated with the forward primer 5'-GCGGATCCTCT- 
GGCACCTCCAGCAAC-3' and the reverse primer 5'-GCGGAATTCT- 
CAATCAGGTGTCCTCACAGG-3'. GST-CT was generated with the for- 
ward primer 5'-GCGGGATCCCCTGTGAGGACACCTGATG-3' and the 
reverse primer S'-GCGGAATTCTCAGGAAGAATCGGCATPC-S', The 
resulting polymerase chain reaction products were digested with 
BamYLl and EcoRI and cloned in-frame into the corresponding sites of 
pGEX-2T. GST fusion proteins encoding full-length mouse PACSIN 1 
and 2 were prepared as described (26). PACSIN isoforms containing 
single amino acid changes in the SH3 domains were derived using the 
mutation oligonucleotides P1-P434L (5'-GGCCTCTATCTCGCGAAC- 
TACG'ITG.3') for PACSIN 1 and P2-P478L (5'-GGCCTATACCTCGC- 
GAACTATGTCG.3') for PACSIN 2 on the corresponding wild-type 
cDNAs in combination with the Transformer™ site-directed mutaeen- 
esis kit (CLONTECH). 

Tissue and Subcellular Fractionation^V arions adult rat tissues, 
including brain, were homogenized in Buffer A (10 mM HEPES-OH, pH 
7.4, 0.83 mM benzamidine, 0.23 mM phenylm ethyls ulfonyl fluoride,* 0.5 
/ig/ml aprotinin, and 0.5 ^ml leupeptin). A postnuclear supernatant 
was obtained by centriftigation for 5 min at 800 x g^, and the extracts 
were then separated into cytosolic and membrane fractions by ultra- 
centrifugation at 205 000 X for 30 min at 4 •C. In some cases, the 
postnuclear supernatant was incubated with 1% Triton X-100 for 30 
min prior to ultracentrifugation, leading to a crude Triton-soluble ly- 
sate. Differential centrifugation of rat brain extracts, leading to the 



defined subcellular fractions in Fig. 2, was performed as described 
previously (29). 

Overlay Assays— Overlay assays with GST fusion proteins were per- 
formed as described (30). Briefly, protein fractions were resolved by 
SDS-PAGE and transferred to nitrocellulose. Membranes were blocked 
in 5% nonfat dry milk powder in phosphate- bufTered saline (PBS) (20 
mM NaHgPO^, 0.9% NaCl, pH 7.4) for 1 h and incubated overnight at 
4 "C with 10 or 20 fig of GST fusion protein diluted in Tris-buffered 
saline (20 mM Tris-Cl, 150 mM NaCl, pH 7.4) containing 3% bovine 
serum albumin, 0.1% Tween-20, and 1 mM dithiothreitol. Bound fusion 
protein was subsequently detected using affinity-purified antibodies 
directed against GST. 

Affinity Chromatography—CytosoUc or crude Triton-soluble lysates 
were prepared from different tissues as described above. For the cyto- 
solic samples, Triton X-100 was added to 1%. The samples were then 
incubated for 2 h or overnight at 4 *C with GST fusion proteins pre- 
coupled to glutathione-Sepharose beads. After incubation, samples 
were washed three times in Buffer A containing 1% Triton X-100, and 
bound proteins were resolved by SDS-PAGE and processed for Western 
blot analysis or stained with Coomassie Blue, For precipitation exper- 
iments with full-length fusion proteins of wild-type and mutant 
PACSINs, mouse brains were homogenized in Buffer B (10 mM HEPES, 
pH 7.4, 150 mM NaCl, 1 mM EGTA, 0.1 mM MgCIg,) containing 1% 
CHAPS and a protease inhibitor mixture (Sigma). The homogenates 
were centrifuged for 30 min at 21,000 X g, the supernatant was de- 
canted and recentrifuged, and Triton X-100 was added to the resulting 
supernatant at a final concentration of 0.05%. The preparation was 
dialyzed overnight against Buffer B and centrifuged as before. The 
resulting supernatant was incubated overnight at 4 "C with GST-PAC- 
SlNs precbupled to glutathione-Sepharose. The beads were subse- 
quently washed extensively in Buffer B containing 0.1% Triton X-100, 
and bound proteins were resolved by SDS-PAGE and processed for 
Western blot analysis. For phosphorylation experiments, COS-7 cells 
were serum-starved overnight and preincubated with 50 /im PD-098059 
(Cajbiochem, La Jolla, CA) in MegSO or MeaSO alone for 30 min at 
37 *C. Cells were then treated with 100 ng/ml epidermal growth factor 
(EGF) for 5 min at 37 ^C in the absence or presence of PD-098059 and 
lysed in ice-cold Buffer C (10 mM HEPES-OH, pH 7.4, 5 mM EGTA, 5 
mM EDTA, 50 mM sodium fluoride, 20 mM sodium pyrophosphate, 1 mM 
sodium vanadate, 0.83 mM benzamidine, 0.23 mM phenylmethylsulfonyl 
fluoride, 0.5 /ig/ml aprotinin, and 0.5 /ig/ml leupeptin). Cell lysates 
were solubilized with 1% Triton X-100 and centrifuged in a Beckman 
TLA 100.2 rotor at 75,000 rpm for 15 min. Cell extracts were incubated 
for 1 h at 4 "C with GST fusion proteins (-5 /ig/ml of each protein) 
prebound to glutathione-Sepharose beads. After incubation, samples 
were washed three times in Buffer C containing 1% Triton X-100, and 
bound proteins were resolved by SDS-PAGE and processed for Western 
blot analysis. 

Immunoprecipitation Ana/ysis— Cytosolic or crude Triton-soluble ly- 
sates were prepared from different tissues as described above. For the 
cytosolic samples, Triton X-100 was added to 1%. The sample were then 
precleared by incubation with protein A-Sepharose, and an aliquot of 
the precleared extract was incubated overnight at 4 *C with normal 
rabbit serum or different antibodies precoupled to protein A-Sepharose. 
Beads were washed in Buffer A containing 1% Triton X-100, and pro- 
teins specifically boimd to the beads were eluted and processed for 
SDS-PAGE. For immunodepletion experiments, an identical protocol 
was used except that 100 /tg of cytosolic extract was added to the beads 
and the material that did not bind to the beads was processed for 
SDS-PAGE. 

Primary Cell Culture — Dissociated cell cultures were prepared from 
the CA3 and dentate regions of hippocampi from PI rat pups as de- 
scribed (31). Following 2 days in culture, the hippocampal neurons were 
fixed with PBS containing 4% paraformaldehyde and 4% sucrose for 15 
min at room temperature. The ceils were permeabilized with 0.1% 
Triton X-100 for 5 min and blocked with PBS containing 1% normal 
goat serum. For explant cultures, dorsal root ganglia were dissected 
from E15 Harlan Sprague-Dawley rats. The ganglia were cut in half 
and placed on coverslips coated with 10 /tg/ml natural mouse laminin 
(life Technologies, Inc.). The cultures were incubated in F-12 culture 
medium containing 10% fetal calf serum supplemented with 50 ng/ml 
7S nerve growth factor. Following 1-2 days in culture, the cells were 
fixed with PBS containing 3% paraformaldehyde and 0.3 M sucrose for 
30 min at room temperature. The cells were permeabilized with 0.2% 
Triton X-100 for 3 min and blocked with PBS containing 5% bovine 
serum albumin and 5% normal goat serum. Following blocking, both 
culture types were processed for immunofluorescence with antibodies 
2704 and C23. In some cases, filamentous actin was detected with 



26624 



Sos Binds PACSIN / Syndapin 



A 



68- 

— -p50 

45- 




120- W^PW ^amphl 

SM V B SM V B SNt V B 
GST GST-NT GST-CT 
Sos-PRC) 

Fig, 1. Overlay analysis of mSosl -binding proteins. A, proteins 
of cytosolic fractions from rat brain were separated by SDS-PAGE, 
transferred to nitrocellulose, and overlaid with GST fused to amino 
acids 1111-1228 (GST-NT) or amino acids 1223-1341 {GST-CT) of the 
PRD of mouse Sosl or with GST alone. The migratory positions of three 
major bands that bind to GST-NT and that of the molecular weight 
standards are indicated on the right and left, respectively. B, a crude 
Triton X-lOO-soluble rat brain extract was incubated with GST, GST- 
NT. or GST-CT conjugated to glutathione-Sepharose beads. Proteins 
specifically boimd to the beads (B) along with aliquots of the brain 
extract (starting material (SAf)) and equal amounts of the unbound 
material (void (V)) were processed for Western blot with an antibody 
that recognizes both amphiphysin I (amph f) and amphiphysin II {amph 
ID. The molecular masses of amphiphysin I and II are indicated on the 
left. 

phalloidin-Alexa 488 (Jackson ImmunoResearch). 

Immunofluorescence on COS -7 Cells — COS-7 cells were plated on 
poly-L-lysine-coated coverslips, trans fected with LipofectAMINE 2000 
(Life Technologies, Inc.), serum-starved overnight, and then left un- 
treated or stimulated with 100 ng/ml EGF for 2 min at 37 *C. Cells were 
washed twice in ice-cold PBS and processed for immunofluorescence as 
previously described (32) using antibodies 2704 or C23. Filamentous 
actin was detected with phalloidin-tetramethylrhodamine isothiocya- 
nate (Sigma). 

RESULTS AND DISCUSSION 

We previously demonstrated that mSos interacts through its 
PRD with the endocj^ic protein intersectin, suggesting that 
intersectin may target mSos to Has on the endocytic pathway 
(9, 10, 13). As the PRD of mSos contains multiple SH3 domain- 
binding consensus sites, we sought to identify additional mSos 
binding partners. Overlay of adult rat brain extracts with a 
GST fusion protein encoding the N-terminal half of the mouse 
Sosl PRD (GST-NT) (amino adds 1111-1228) identified three 
proteins of 120 (pl20), 90 (p90), and 50 (p50) kDa (Fig. lA). 
None of the bands were detected with a GST fusion protein 
encoding the mouse Sosl PRD C-terminal half (GST-CT) (ami- 
no acids 1223-1341) or with GST alone (Fig. LA). Previously, 
Leprince et at. (8) identified amphiphysin 11 as a mSosl binding 
partner. To determine whether pl20 and p90 correspond to 
amphiphysin I (120 kDa) and amphiphysin II (90 kDa), respec- 
tively, we used GST-NT, GST-CT, or GST alone in pull-down 
assays with soluble rat brain extracts. Western blots of the 
pull-downs demonstrated that both amphiphysin I and II bind 
specifically to GST-NT (Fig. LB), suggesting that they repre- 
sent pl20 and p90. Intersectin also bound selectively to GST- 
NT, whereas Grb2 bound equally well to GST-NT and GST-CT 
(data not shown). Surprisingly, neither Grb2 nor intersectin 
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Fig. 2. Tissue and subcellular distribution of p50. A, proteins of 
crude Triton X-lOO-soluble extracts from various adult rat tissues (200 
fig/tissue) were separated by SDS-PAGE, transferred to nitrocellulose, 
and overlaid with GST-NT. B, proteins of brain subcellular fractions 
(100 ^fraction) were separated by SDS-PAGE, transferred to nitro- 
cellulose, and overlaid with GST-NT. Subcellular fractions were pre- 
pared as described (28). H, homogenate; P, pellet; S, supernatant; LP, 
lysed pellet; LS, lysed supernatant. For both A and B, the migratory 
position of p50 is indicated by the arrow on the right. 

was detected on the overlay assays, possibly due to lower levels 
of expression in brain extracts than the amphiphysins or p50. 
The abundant SH3 domain-containing protein, endophihn 1, 
which is readily detected on overlays with the PRDs of synap- 
tojanin 1 (33) and dynamin 1 (34), was not seen on the overlays 
with the PRD of mSosl, further demonstrating the specificity 
of the interactions detected. 

To characterize p50, the major mSosl binding partner iden- 
tified, we performed overlays with GST-NT on tissue extracts. 
p50 was detected in brain but not in a variety of nonneuronal 
tissues (Fig. 2A). This is consistent with the distribution of 
mSosl that is expressed at higher levels in brain than in other 
tissues (9). Within brain, subcellular fractionation revealed p50 
in both soluble and particulate fractions (Fig. 2B). The greatest 
enrichment was seen in the second lysed supernatant fraction 
(Fig. 2B, LSzX which contained soluble proteins generated from 
the lysis of crude s3niaptosomes. This distribution is similar to 
that previously described for mSosl (9), as well as that of the 
presynaptically enriched endocjrtic regulatory enzymes dy- 
namin 1 and synaptoj£uiin 1 (29). 

To identify p50, we used the mSosl GST fusion proteins to 
affinity piuify mSosl-binding proteins from a soluble rat brain 
extract. As determined by Coomassie Blue staining, a 50-kDa 
band that bound to GST-NT but not to GST-CT or to GST alone 
was the major affinity-selected protein (Fig. 3). Minor bands at 
120, 90, and 70 kDa were also weakly detected. The 50-kDa 
band was excised from the gel and subjected to trypsin diges- 
tion, and the fragments were analyzed by matrix assisted laser 
desorption ionization mass spectrometry at the W. M. Keck 
Foundation Biotechnology Resource Laboratory at Yale Uni- 
versity. A ProFound search of the peptide masses provided a 
tentative identification for p50 as PACSIN 1/syndapin I. PAC- 
SIN 1 was identified based on its up-regulation during neuro- 
nal differentiation in mouse (22), whereas its rat orthologue 
syndapin I was identified through its SH3 domain-dependent 
interaction with dynamin 1 (23). PACSIN 1, which contains an 
SH3 domain at its C terminus, is a neuron-specific protein with 
a predicted molecular mass of 50 kDa, consistent with its 
identification as p50. To support this identification, we per- 
formed overlay assays with GST-NT or GST alone on cells 
transfected with a cDNA encoding Hiss-tagged full-length 
PACSIN 1. GST-NT specifically interacted with a protein spe- 
cies that perfectly co-migrated with PACSIN 1, as detected 
with an anti-Hisg Western blot (Fig. 4A), demonstrating that 
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Fig. 3. Affinity purification of p50. A rat brain cytosolic extract 
was incubated with GST, GST-NT, or GST-CT conjugated to glutathi- 
one-Sepharose beads. Proteins specifically bound to the beads [B) along 
with aliquots of the soluble brain extract (starting material (SAO) and 
equal amounts of the unbound material (void (V)) were subjected to 
SDS-PAGE and detected by Coomassie Blue staining. The migratory 
position of p50 (asterisk) as well as the mSosl GST fusion proteins and 
GST alone are indicated on the right. The migratory positions of the 
molecular weight standards are indicated on the left. 

PACSIN 1 directly interacts with mSosl. The identity of p50 as 
PACSIN 1 was confirmed with the demonstration that immu- 
nodepletion of PACSIN 1 from brain extracts using an anti- 
PACSIN 1 antibody completely depletes p50, as determined by 
GST-NT overlay (Fig. 

In addition to PACSIN 1, a second member of the PACSIN 
family, referred to as PACSIN 2, has been recently described 
(24, 25). In contrast to PACSIN 1, which is expressed exclu- 
sively in neurons, PACSIN 2 is expressed in brain and nonneu- 
ronal tissues (24, 25) (Fig. 5A). Because mSosl is enriched in 
brain but is also expressed in nonneuronal tissues (9), we 
hypothesized that mSos-PACSIN 2 interactions may occur both 
within and outside the nervous system. To explore this ques- 
tion, we performed pull-down assays from Triton X-lOO-solubi- 
lized extracts prepared from brain, as well as from lung, which 
expresses high level of PACSIN 2 (Fig. 5A) (26). As expected, 
we detected binding of PACSIN 2 from both tissues to the 
mSosl PRD (Fig. 5B). In fact, PACSIN 2 is likely to represent 
the 70-kDa band that was weakly detectable on the Coomassie 
blue stained pull-downs from brain extracts using mSos 
GST-NT (Fig. 3). Surprisingly, the level of PACSIN 2 recovered 
on the mSosl-PRD fusion protein was consistently greater 
when using brain versus lung extracts, even though PACSIN 2 
was more abundant in the extracts from lung (Fig. 5, A and B). 
Moreover, a second, slightly smaller band that reacted with the 
PACSIN 2 antibody was detected in the GST-NT pull-downs 
from brain extracts but not from lung extracts (Fig. dB). This 
protein may represent the short splice variant of PACSIN 2 
previously described in rat tissues (25). Comparable amounts 
of the 70-kDa protein were pulled down from lung and brain 
extracts with an anti-PACSIN 2 antibody, suggesting that 
PACSIN 2 is equally accessible in both tissues (Fig. 55). Thus, 
the differential binding of the long form of PACSIN 2 to the 
mSosl PRD in brain compared with lung reveals a tissue- 
specific regulation of the interaction. The reason for this obser- 
vation is cturently unknown. However, it is possible that a 
tissue-specific posttranslational modification of PACSIN 2 al- 
ters its affinity for the mSos PRD. 
To address whether the observed interactions are dependent 
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Fig. 4. Confinnation of p50 as PACSIN 1. A, cytosolic fractions of 
HEK-293 cells transfected with vector alone (mock) or with Hise-tagged 
PACSIN 1 (PACSIN 1) were resolved by SDS-PAGE and processed for 
Western blot with an anti-HiSg antibody (a-His) or were overlaid with 
GST or GST-NT. The migratory positions of Hisg-PACSIN 1 and the 
molecular weight standards are shown on the right and left, respec- 
tively. B, an anti-PACSIN I serum (PACSIN 1) and normal rabbit 
serum (NRS) were coupled to protein A-Sepharose beads and incubated 
with a rat brain cytosolic extract. The unbound material (void ofLp,), 
along with an aliquot of the cytosolic extract (starting material (SM)) 
were processed for Western blot with the anti-PACSIN I antibody or 
were overlaid with GST-NT. 



on the classical SH3 domain binding interface, we generated 
point mutations in the SH3 domains of PACSIN 1 and 2 that 
converted proline to leucine (P434L for PACSIN 1; P478L for 
PACSIN 2). Comparable mutations in the Caenorhabditis el- 
egans Grb2 homologue sem-5 cause a lethal phenotype by pre- 
venting scm-5 interactions with its PRD-containing binding 
partners (35). Using wild-type and mutated PACSIN 1 and 2 
expressed as GST fusion proteins, we performed puU-down 
assays from brain extracts (Fig. 50). mSosl was foimd to in- 
teract with both wild-type fusion proteins, whereas the proline 
to leucine mutations aboHshed mSosl binding to both 
PACSINs, demonstrating that the interactions are specifically 
mediated through the PACSIN SH3 domains. 

To explore the potential interaction between PACSIN 1 and 
mSosl in situ, we performed co-immunoprecipitation experi- 
ments from rat brain extracts. Immunoprecipitation of 
PACSIN 1 led to co-immunoprecipitation of mSosl (Fig. 6). The 
interaction was specific, as no mSosl precipitated in the pres- 
ence of normal rabbit serum, and the abundant brain protein 
tubuhn was not detected in the anti-PACSIN 1 immunoprecipi- 
tates (Fig. 6). Only a limited percentage of the total mSosl in 
the brain extract co-immunoprecipitated with PACSIN 1. This 
is not surprising given that PACSIN 1 interacts through, its 
SH3 domain with multiple binding partners, including the 
abundant brain proteins dynamin 1 and synaptojanin 1 (23), In 
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' Fig. 5. Specific interaction of mSosl with the SH3 domains of 
PACSIN 1 and 2. A, proteins of crude Triton-soluble extracts from 
various adult rat tissues (200 ^tissue) were separated by SDS-PAGE, 
transferred to nitrocellulose, and processed for Western blot using 
antibodies against PACSIN 2. B, GST-NT conjugated to glutathione- 
Sepharose beads (GST-NT) and anti-PACSIN 2 antiserum precoupled 
to protein A-Sepharose beads (PACSIN 2 tp.) were incubated with 
crude Triton-soluble extracts from rat lung and brain. Proteins specif- 
ically bound to the beads (bead) along with aliquots of the lung and 
brain extract (starting material (SM)) were subjected to SDS-PAGE and 
processed for Western blot using antibodies against PACSIN 2. C, a 
. mouse brain extract was incubated with glutathione-Sepharose beads 
coiyugated to GST alone or GST fused to full-length wild-type (wt) 
PACSINs or PACSINs with point mutations in their SH3 domains 
(PACSIN 1, proline 434 mutated to leucine (P434L); PACSIN 2, proline 
478 mutated to leucine (P478L)). Material specifically bound to the 
beads was resolved by SDS-PAGE along with an aliquot of the brain 
extract (starting material (SM)) and processed for Western blot using 
antibodies against mSosl. 

fact, dynamin 1 was found to strongly co-immunoprecipitate 
with PACSIN 1 (Fig. 6). As the interactions between PACSIN 1 
and its various SH3 domain-binding partners are likely to be 
competitive, the PACSIN-mSos interaction may be restricted to 
specific subcellular domains that are enriched for mSosl rela- 
tive to other PACSIN binding partners. Alternatively, the 
PACSINs may be at the core of large protein complexes in 
which they simultaneously interact with multiple binding part- 
ners. Consistent with the later possibility, it has been recently 
demonstrated that the PACSINs can self-associate to form 
homo- and hetero-oligomers (26). 

r-: To further examine the potential for interactions between 
PACSIN 1 and mSosl in situ, we sought to determine whether 
the proteins were co-distributed in neurons. Immvmofluores- 
cence analysis of hippocampal neurons at 2 days in vitro with 
polyclonal antibodies against each protein revealed strong 
staining in the neuronal cell bodies, with fluorescent punctae 
observed along the length of the neurites and in grow^ cones 
(Pig. 7, A and O. Staining for both proteins extended into 
filopodia emanating from the growth cones (Fig. 7, B and D). To 
examine the localization within growth cones in more detail, we 
performed immunofluorescence analysis of primary rat dorsal 
root ganglia neiu-ons maintained in culture for 1 day. Similar to 
hippocampal neurons, . both PACSIN 1 and mSosl were de- 



i.p. 

Fig. 6. mSosl interacts with PACSIN 1 in vivo. Anti-PACSIN 1 
antiserum (PACSIN I) and normal rabbit serum (NRS), precoupled to 
protein A-Seph arose beads, were incubated with soluble extracts from 
rat brain. Material specifically bound to the beads (immunoprecipita- 
tion, i.p.) was resolved by SDS-PAGE along with an aliquot of the brain 
extract (starting material (SM)) and processed for Western blot using 
antibodies against mSosl, dynamin 1, tubulin, and PACSIN 1 as 
indicated. 




Fig. 7. Co-expression of PACSIN 1 and mSosl in growth cones 
and filopodia of cultured hippocampal neurons. Confocal images 
of CAS hippocampal neurons at 2 days in vitro stained for mSosl (A and 
B) and PACSIN 1 (C and D), Arrows in A and C indicate growth cones. 
A higher magnification image of the growth cone reveals punctate 
staining along the neurites and filopodia (arrows in B and D). Scale bar, 
10 /xm in (A and O and 1.5 ^m in B and D. 

tected in dorsal root ganglia growth cones and were seen to 
extend into filopodia (Fig. 8). Interestingly, co-staining with 
phalloidin, which reveals filamentous actin, demonstrated that 
both mSosl (Fig. 8A) and PACSIN 1 (Fig. 8B) were strongly 
co-localized with actin filaments at the plasma membrane and 
throughout the length of the filopodia. PACSIN has been dem- 
onstrated to localize at sites of high actin turnover, including 
filopodia and filopodial tips in nonneuronal cells (25). Further, 
overexpression of full-length PACSIN causes filopodia forma- 
tion in an N-WASP-dependent manner, although the mecha- 
nism of this activation is unknown (25). The co-localization of 
mSosl with PACSIN 1 in filopodia suggests that mSosl may 
cooperate with PACSIN 1 in filopodia formation or function. 
This role may be particularly relevant during neuronal devel- 
opment, as actin-dependent filopodial dynamics are critical in 
the response of neuronal growth cones to extracellular guid- 
ance cues (36). Filopodia formation is dependent . on 
N-WASP (37), which is activated by interactions with SH3 
domains (38), as well as by binding to phosphatidylinositol 
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Fig. 8. Co-locaUzation of mSosl and PACSIN 1 with actin in 
growth cones and filopodia of dorsal root ganglia neurons in 
culture. Growth cones of E16 dorsal root ganglia neurons from an 
explant culture stained with an antibody against mSosl iA) or PACSIN 
1 (B) and co-stained with phalloidin-Alexa 488 (phalloidin) to demon- 
strate F-actin. Arrows point to filopodia demonstrating co-localizat- 
ion of mSosl or PACSIN with actin. Scale bar, 5 in low power (top 
three panels of A and B) and 2 /im in high power (bottom three panels in 
A and B). 

(4,5)bisphosphate and GTP-bound Cdc42 (39, 40). Thus, 
PACSIN may cause filopodia formation via SH3 domain-de- 
pendent stimulation of N-WASP. As activated Rac binds to 
phosphatidylinositol 4-phosphate 5-kinase, leading to phos- 
phatidylinositol (4,5)bisphosphate production (41, 42), mSos 
could contribute to N-WASP stimulation via activation of Rac 
(17-19, 21). 

We next investigated the possibility of direct co-localization 
of PACSIN 1 and mSosl in actin-rich structures. Fibroblasts 
are well established to form membrane ruffles upon treatment 
with growth factors (43, 44). We therefore co-transfected COS-7 
cells with FLAG-tagged mSosl and Hisg-tagged PACSIN 1 and 
examined their distribution before and after EOF treatment. 
Interestingly, both PACSIN 1 and mSosl relocahzed from a 
predominantly cytoplasmic distribution to become concen- 
trated and co-localized at membrane ruffles following EOF 
treatment (Fig. 9A). Co-staining of PACSIN 1 transfected cells 
with anti-PACSIN 1 antibody and fluorescent phalloidin con- 
firmed that the structures at which PACSIN 1 and mSosl were 
co-localized were actin-rich membrane ruffles (Fig. 9B). 
Growth factor-induced ruffle formation is mediated by Ras-de- 
pendent Rac activation (43). Recent data suggest that mSos 
plays an important dual role in coupling Ras to Rac (18). 
Through its CDC25 homology domain, mSos activates Ras, 
which in turn activates phosphatidylinositol 3-phosphate ki- 
nase (45). The products of phosphatidylinositol 3-phosphate 
kinase catalytic activity stimulate the GEF activity of mSos 
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Fig. 9. EOF treatment stimulates PACSIN 1/mSosl co-localiza- 
tion in membrane ruffles. COS-7 cells, co-transfected with His^- 
tagged PACSIN 1 and FLAG-tagged mSosl (A) or transfected with 
PACSIN 1 alone (B), were serum-starved and left untreated (-EGF) or 
were treated with 100 ng/ml EOF for 2 min (+EGF). PACSIN 1 was 
detected with an anti-PACSIN 1 rabbit antibody, and mSosl was de- 
tected with anti-FLAG monoclonal antibody. F-actin was detected using 
phalloidin-tetramethylrhodamine isothiocyanate (phalloidin). Magnifi- 
cation, X 630. 
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Fig. 10. Regulation of the PACSIN 1-mSosl interaction by 
phosphorylation. Serum-starved COS-7 cells were treated with 100 
ng/ml of EGF for 5 min at 37 'C (EGF +). When indicated, prior to the 
EGF treatment, cells were preincubated with PD-098059 (PD +). Ly- 
sates from cells were incubated with GST-PACSIN 1 SH3 coupled to 
glutathione-Sepharose beads, and material specifically bound to the 
beads was processed for SDS-PAGE, along with an aliquot of cell lysate 
(starting material (SAO). The migratory position of mSosl and the 
upwardly shifted phosphorylated mSosl (phospho-mSosI) and that of 
GST-PACSIN 1 SH3 domain are indicated on the right. 

toward Rac, causing Rac activation (17, 18). Indeed, mSos has 
been demonstrated to function directly in growth factor-in- 
duced membrane ruffle formation (18, 21). Our observation 
that full-length mSosl targets to sites of Rac activation in 
response to EGF stimulation is consistent with its role in 
Rac-dependent actin reorganization. The co-localization of 
PACSIN 1 with mSosl suggests that through direct protein 
interactions, PACSIN may regulate mSos function in ruffle 



26628 



Sos Binds PACSINISyndapin 



formation. In fact, PACSIN 1 interacts with mSosl within the 
same region that mediates mSosl interactions with Eps8 and 
E3bl (18). The binding of these proteins to mSos stimulates 
mSos GEF activity toward Rac (18). Thus, PACSIN interac- 
tions with mSos may play a comparable modulatory role. 

We next sought to determine whether the interaction be- 
tween PACSIN 1 and mSosl was subject to regulation in re- 
sponse to growth factor stimulation. Several laboratories have 
demonstrated that MEK-dependent feedback phosphorylation 
of mSos leads to Ras desensitization by inducing the dissocia- 
tion of mSos from Grb2 and She (10, 28, 46-49). We thus 
examined mSosl-PACSIN 1 interactions following MEK-in- 
duced mSosl phosphorylation. Treatment of COS-7 cells with 
EOF led to a small but highly reproducible upward shift in 
mSosl mobility (Fig. 10). This shift is characteristic of MEK- 
induced mSosl phosphorylation (28, 49) and was blocked by 
preincubation with the MEK inhibitor PD-098059 (Fig. 10). 
Interestingly, phosphorylated mSosl showed less binding to 
the SH3 domain of PACSIN 1 than did nonphosphorylated 
mSosl (Fig. 10; representative of three separate experiments). 
In contrast, the binding of N-WASP to the PACSIN 1 SH3 
domain was identical under the various treatment conditions 
(data not shown). Thus, phosphorylation of mSosl negatively 
regiilates its interaction with PACSIN 1. Interactions between 
SH3 domain-containing proteins and their proline-rich binding 
partners are often regulated by phosphorylation. For example, 
phosphorylation of dynamin 1 and synaptojanin 1 reduces their 
interactions with the SH3 domains of the amphiphysins, allow- 
ing for regulation in their targeting to clathrin-coated pits (50). 

The activation of Ras and the formation of membrane ruffles 
occurs rapidly following EOF treatment of COS-7 cells (both 
can be detected within 30 s).^ In contrast, MEK-dependent 
feedback phosphorylation of mSos and the dissociation of the 
mSos-Grb2 complex is detectable only after several minutes 
(48). Thus, in parallel with the mSos-Grb2 interaction it is 
likely that mSos and PACSIN are initiaUy in association fol- 
lowing their EGF induced translocation to membrane ruffles. 
Feedback phosphorylation via the Ras/MEK-dependent path- 
way, which appears to be a general mechanism to attenuate 
Ras signaling, would subsequently terminate the PACSIN- 
mSos interaction. 

It is well estabhshed that Rac functions downstream of Ras 
in Ras-mediated signaling to alterations in the actin cytoskel- 
eton (51). The ability of mSos to function as a GEF for Rac is 
central to the transduction of signals from Ras to Rac (16) and 
suggests the need for adaptor proteins involved in targeting 
mSos to sites of actin dynamics. Interestingly, Eps8, which 
activates the mSos GEF activity toward Rac following its indi- 
rect binding to mSos, also binds to actin (16). Thus, it is tempt- 
ing to speculate that the PACSINs, which localize to sites of 
actin turnover and regulate actin cytoskeletal dynamics, may 
play a role in targeting mSos to sites of actin dynamics. Alter- 
natively, interactions between PACSIN and mSos may aUow 
for coordinated activities of the two proteins in regulation of the 
actin cytoskeleton. Future experiments wiU be aimed at testing 
these various hypotheses. 
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Abstract Twelve diseases, most with neuropsychiatric 
features, arise from trinucleotide repeat expansion muta- 
tions. Expansion mutations may also cause a number of 
other disorders, including several additional forms of 
spinocerebellar ataxia, bipolar affective disorder, schizo- 
phrenia, and autism. To obtain candiate genes for these 
disorders, cDNA libraries from adult and fetal human 
brain were screened at high stringency for clones con- 
tainmg CAG repeats. Nineteen cDNAs were isolated and 
mapped to chromosomes 1, 2, 4, 6, 7, 8, 9, 12, 16, 19, 20, 
and X. The clones contain between 4 and 17 consecutive 
CAG, CTG, TCG, or GCA triplets. Clone H44 encodes 
40 consecutive glutamines, more than any other entry in 
the nonredundant GenBank protein database and well 
within the range that causes neuronal degeneration in 
several of the glutamine expansion diseases. Eight cDNAs 
encode 15 or more consecutive glutamine residues, sug- 
gesting that the gene products may function as transcrip- 
tion factors, with a potential role in the regulation of neu- 
rodevelopment or neuroplasticity. In particular, the con- 
ceptual translation of clone CTG3a contains 18 consecu- 
tive glutamines and is 45% identical to the C-terminal 306 
residues of the mouse numb gene product. These genes 
are therefore candidates for diseases featuring anticipa- 
tion, neurodegeneration, or abnormalities of neurodevelop- 
ment. 
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Introduction 

Trinucleotide repeat expansion mutation is now know to 
cause 12 diseases, most with neuropsychiatric features 
(Linblad and Schalling 1996; Paulson and Fischbeck 
1996; Ross 1995; Zoghbi 1996). Seven of these are 
known as the type 1 disorders - spinocerebellar ataxia 
type 1 (SCAl, Orr et al. 1993), SCA2 (Imbert et al. 1996; 
Pulst et al. 1996; Sanpei et al. 1996), Machado- Joseph 
disease (MJD or SCA3, Kawaguchi et al. 1994), SCA6 
(Zhuchenko et al. 1997), dentatorubral pallidoluysian at- 
rophy (DRPLA, Koide et al. 1994; Nagafuchi et al. 1994), 
Huntington's disease (HD, Huntington's Disease Collabo- 
rative Research Group 1993), and spinal and bulbar mus- 
cular atrophy (SBMA, La Spada et al. 1991). Each is 
caused by a (CAG)ii expansion in an open reading frame, 
resulting in an expanded glutamine repeat. The properties 
of the repeats in the other (type 2) expansion mutation dis- 
eases vary widely. Myotonic dystrophy is caused by a 3' 
untranslated (CTG)n expansion (Brook et al. 1992; Fu et 
al. 1992; Mahadevan et al. 1992), the A and E forms of 
fragile X syndrome (Fu et al. 1991; Knight et al. 1993; 
Kremer et al. 1991; Verkerk et al. 1991) and some cases of 
Jacobsen's syndrome (Jones et al. 1995) result from 5' un- 
translated region (CCG)n expansions, and Friedreich's 
ataxia is caused by an intronic (GAA)n expansion (Cam- 
puzano et al. 1996). Expandable trinucleotide repeats 
therefore are found in translated, transcribed but imtrans- 
lated, and intronic regions; they may be G-C or A-T rich 
and range from minimal to highly variable in length in the 
normal population. 

At least four lines of evidence indicate that additional 
disorders may arise from trinucleotide repeat expansion 
mutations. First, an antibody (IC2) that specifically recog- 
nizes expanded glutamine repeats detects an expansion 
segregating with SCAT (Trottier et al. 1995). Second, in- 
direct evidence of CAG expansion has been detected us- 
ing rapid expansion detection (RED, Schalling et al. 
1993) in a pedigree with SCA7, and less clearly in hetero- 
geneous populations of patients with bipolar affective 
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disorder and schizophrenia (Linblad et al. 1996; Linblad 
and Schalling 1996; O'Donovan et al. 1995). Third, sev- 
eral neurodegenerative disorders, including SCA4, 
SCA5, SCA7, and familial Parkinson disease, are pheno- 
typically similar to the type I expansion mutation disor- 
ders. Fovirth, anticipation, the phenomenon of increasing 
phenotypic severity or decreasing age of onset in succes- 
sive generations affected by a disease (Mclnnis 1996; 
Ross et al. 1993), is found in most of the expansion mu- 
tation diseases. Anticipation has been detected in a dis- 
parate group of other diseases, including affective disor- 
der (Engstrom et al. 1995; Mclnnis et al. 1993; Nylander 
et al. 1994), schizophrenia (Chotai et al. 1995; Gorwood 
et al. 1996; Stober et al. 1995; Thibaut et al. 1995), 
autism (Stine 1993), familial Parkinsonism (Bonifati et 
al. 1995; Markopoulou et al. 1995; Payami et al. 1995; 
Plante-Bordeneuve et al. 1995), familial leukemias (Hor- 
witz et al. 1996), Crohn's disease (Polito et al. 1996), 
Meniere's disease (Morrison 1995), torsion dystonia 
(LaBuda et al. 1993), rheumatoid arthritis (McDermott et 
al. 1996), facioscapulohumeral muscular dystrophy 
(Tawil et al. 1996), Holt-Oram syndrome (Newbury- 
Ecob et al. 1996), and familial spastic paraplegia 
(Raskindetal. 1997). 

We have sought to identify candidate genes for these 
disorders by screening cDNA libraries for the presence of 
DNA fragments containing CAG, CCG, CCA, and AAT 
trinucleotide repeats (Li et al. 1993; Margolis et al. 1995 
a, b). Our description of CTG-B37, a cDNA fragment 
with a highly polymorphic CAG repeat located within an 
open reading frame on chromosome 12, directly led to 
the finding that an expansion mutation within the CTG- 
B37 repeat causes DPIPLA (Koide et al. 1994; Nagaftichi 
et al. 1994). This same strategy of screening cDNA li- 
braries for trinucleotide repeats was later employed to 
identify the MJD gene (Kawaguchi et al. 1994) and the 
SCA6 gene (Zhuchenko et al. 1997). Screening genomic 
contigs for trinucleotide repeats was used to clone the 
gene for SCA2 (Pulst et al. 1996). Based on the repeats 
that expand to cause disease, repeats with the highest 
likelihood of xmdergoing expansion mutation consist of at 
least six consecutive CAG or CTG triplets in the tran- 
scribed portions of genes expressed in brain. To identify 
genes with these features, we have screened human adult 
frontal cortex and fetal brain cDNA libraries at high strin- 
gency for the presence of CAG or CTG repeats. We now 
report the identification and mapping of 19 of these 
cDNA fragments. 



Materials and methods 

cDNA cloning 

Adult human frontal cortex (clones with CTG prefixes) and human 
fetal brain (all others) cDNA libraries (Strategene) was screened 
with either a (CTG)2o or a (CTG) 15 radiolabeled oligonucleotide 
using standard techniques as previously described (Margolis et al. 
1995 a). In brief, approximately 200000-500000 plaques were 
screened in each of four sets of primary screens. Hybridization was 



at 52° (CTG and F clones) or 60** (H and L clones) with 50% for- 
mamide with a 70** (CTG and F clones) or 72** (H and L clones) 
wash at 0.2 x SSC/0.5% SDS for 15 min or until activity was be- 
low about 5 counts/s. After plaque purification, plasmids were ex- 
cised using an in vivo excision procedure and purified. 

Inserts were released by EcoKl digest, and Southern blots were 
prepared after electrophoresis through 1% agarose. Southern blots 
were hybridized as described for library screening, and positive in- 
serts were sequenced using ABI automated technology until the re- 
peat was detected or until a blastn search of the GenBank nonre- 
dundant, dbest, or dbsts libraries indicated that the clone had been 
previously described. Clones H32, 44, and 45 represent consensus 
double-stranded sequences of multiple clones. The contigs were 
formed in part by odier clones identified in the library screen de- 
scribed above and in part by rescreening the fetal brain library with 
45mer oligonucleotides antisense to sequence immediately adja- 
cent to the repeat of each clone. H45: 5' CCTGGGCACTCATTG- 
GAGTCATGGTACTTATCATGGGTGTCTGCT 3'; H44: 5' AG- 
GTGGAATGGAGATGAGTCCCTGACGCTGAAGGCTGAG- 
CAGATG 3' and GGCCGCCCGTTCACCTTGATGAGACCA- 
TTGCCGCGTTTGCAGTGC; H32: 5' AGGTGTGATGGTTTT- 
TCCAGGGAGCTGAACTCCTGTGGCTTGGGA 3'. 



Mapping 

Clones were assigned to a chromosome or a specific locus using a 
variety of methods as indicated in Table 1. The NIGMS panel 2 
monochromosomal human-rodent hybrid cell lines were screened 
using PCR primers that spanned the repeat as indicated in Table 2 
(Dubois and Naylor 1993). Radiation hybrid mapping was per- 
formed by Research Genetics on either the GeneBridge 4 or Stan- 
ford G3 Radiation Hybrid panels using primers listed in Table 2. 
For L69 and F28, primers amplifying a region flanking the repeat 
were used to avoid crosshybridization with hamster genomic DNA 
(L69: 5' CTCCGAGTATCGTGACAGGTG 3', 5' AAGCAGAG- 
GTGAGGGAGAGTC 3', 59** annealing temperature, 157-bp 
product; CAGF28: 5' GCTGAAGGTGTAAAACCG 3', 5' GA- 
AAGTGACTCCAGAGAC 3', 52° annealing temperature, 86-bp 
product). CTG3a was mapped by analysis of repeat polymorphism 
in CEPH pedigrees 1332, 1347, 1362, and 884 (Weissenbach et al. 
1992) with the LINKAGE program (Ott 1991). As indicated, other 
clones were assigned to a locus either through a GenBank match 
with a previously mapped ESTs (expressed sequence tags) or 
through use of Gene Map (Schuler et al. 1996). 



Polymoqjhism analysis 

Variability of repeat length was determined by size separation of 
radiolabeled PCR products on a 6% denaturing polyacrylamide 
gel, as previously described (Margolis et al. 1995 a). Template 
consisted of 40-100 ng human genomic DNA from either a set of 
unrelated CEPH parents or a panel of unrelated individuals with 
movement disorders of unknown etioloy, and reaction conditions 
were as described in Table 2. Allele lengths of additional individ- 
uals with movement disorders were tested for size variation in L69 
using automated detection (ABI) of fluorescence-labeled PCR 
product. 



Results 

Table 1 summarizes the initial characterization of the 
cDNA clones identified by screening adult human frontal 
cortex or human fetal brain cDNA libraries with a 
(CAG)i5 or (CAG)2o probe. Those clones for which data 
on both polymorphism and chromosomal location are 
available are included, and clones corresponding to 
known genes were excluded except where noted, leaving 



116 

Table 1 Repeat sequence, mapping, and homologies 



Clone 


oenoanK 


Longest 


Conceptual translation 


Nlapping 
method* 


Chromosomal 


Homology 




accession 


repeat 


of repeat region 


location 




CTG3a 


U80758 


6 


Ql8 


E 


19ql3.13-19ql3.2 


mouse numb (U70764) 










D19S224-D19S228 




CTG4a 


U80744 


8 


L4PL9 


A 


1 




CTG7a 


U80745 


8 


QjLQjHLIKLHHQNQsLQRIAQLQLQis 


A 


20 




CTG20a 


U80749 


13 


PQzjPjCorAijTA,,) 


A 


6 




F28 


U80735 


7 


Qg, 54% Q in 68 residue region 


B 


7q36.3 

qter ofD7S2781 


mcag32 (U23862) 


HI 


U80738 


13 


('AO'IiiASOASOk orLi< 


D 


12pl2 

D12S328-D12S89 




H3 


U80747 


5'= 8 
3'= 8 


5': Qu 
3': Q18 


C 


4 


2.119(5' repeat only) 


H16 


U80737 


12 


Q28 


D 


20ql3.13 

D20S891-D20S109 




H26 


U80750 


4 


Q2PQ8PQ4PQ3PQPQPQ4PQ3PQ 


A 


16 


mcagl9 (U23860) 


H32 


U80743 


15 


Q30 


B 


12q24.33 












qter ofD12S367 




H38 


L13744 


10 




C 


9q22 


AF.9 (LI 3744) 


H39 


U80740 


17 


none 


B 


Xpll.4 














AFMA212ZG1-DXS6745 


H44 


U80741 


7 


Q40HPGKQAKEQ10 


B 


6ql4-15 
D6S1291 




H45 


U80742 


7 


Q25YHIQ6lLRQ25HQ6GP(XQ)5 


C 


1 


KIAA0192 (D83783) 


L69 


U80752 


13 


none 


B 


2q37.3 
D2S2704 




L85 


U80755 


10 


Sioor Ai5 orQii 


A 


8 




L114 


U80765 


6 


Qu 


B 


12ql3.12 

WI-7107-DS 12s 1363 




L234 


U80751 


9 


none 


B 


2q21.2 

AFMA153WBI 




L237 


U80757 


15 


none 


B 


16q24.3 

D16S520-WI-12410 





* Mapping methods: A, NIGMS panel 2; B, radiation hybrid panel; C, identity to mapped cDNA; D, Gene Map; E, linkage 



a total of 20 repeat regions from 19 indepdendent cDNA 
clones. The number of consecutive triplets ranges from 4 
to 17 (mean = 9.8). Of the 20 repeats, 16 appear to be lo- 
cated in extended open reading frames. Of these 16, con- 
ceptual translation indicates that 12 (and possibly 14) en- 
code polyglutamine. In each case the CAG repeat is inter- 
rupted frequently by CAA triplets also encoding gluta- 
mine. Eight of these polyglutamine regions consist of 15 
or more consecutive glutamines, including Q28 (HI 6), Q30 
(H32), and two adjacent stretches of Q25 (H45), The 
longest consecutive glutamine repeat, Q40, is present in 
H44 (Fig.l). 

The cDNAs correspond to loci on 12 different chromo- 
somes, as determined by the methods indicated in Table 1 : 
one each from chromosomes 4, 7, 8, 9, .19 and X; two 
each from chromosomes 1, 2, 6, 16, and 20; and three 
from chromosomes 12. Blastn and blastp homology 
searches (Altschul et al. 1990) of GenBank revealed that 
the repeats in F28 and H26 are present in short cDNA 
fragments termed mcag32 and mcagl9 (Yamagata et al. 
1996). H45 is identical to the 3' portion of a longer cDNA 
fragment (D83783) isolated from the KG-1 human cell 
line. D83783 is ubiquitously expressed, and contains both 



a DNA topoisomerase II motif and an N terminal trans- 
membrane domain. A short murine amino acid sequence 
termed the mopa box protein (A26892) is also similar to 
H45. The first repeat in H3 is contained in clone 2.119 
(Neri et al. 1996). H38 is a portion of the human gene AF- 
9, identified in relationship to a t(9:ll) translocation in 
which AF-9 fuses with ALL-1 to induce acute lympho- 
cytic leukemia (Prasad et al. 1993). AF-9 may function in 
the transcriptional complex (Caims et al. 1996; Welch and 
Drubin 1994). CTG3a is similar to the mouse numb gene 
(Zhong et al. 1996), with 44.6% amino acid identity and 
85.3% amino acid conservation (ALIGN, Pearson and 
Lipman 1988) over the length of CTG3a, which extends 
from m-numb amino acid 267 to 585 (Fig. 2). A compari- 
son of CTG3a with both Drosophila and mouse numb by 
Block Maker (Henikoflf et al. 1995) suggested five possi- 
ble blocks, corresponding to CTG3a amino acid residues 
26-52, 57-68, 85-104, 123-133, and 256-264. The 
Gibbs alignment (Lawrence et al. 1993) suggested three 
blocks, corresponding to CTG3a amino acid residues 
29-60, 188-226, and 256-278. A search of these align- 
ments against the Blocks database using LAMA 
(Pietrokovsi 1996) revealed a match only to one block 
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Table 2 Polymorphism analysis 



Clone Longest PGR primers 
nucleotide 
repeat 



Anneal Product 
temper- length 
ature (in bp) 



No. of 
chromo- 
somes tested 



Allele bp: 
frequency 



Hetero- 
zygosity 



CTG3a 6 

CTG4a 8 

CTG7a 8 

CTG20a 13 

F28 7 

HI 13 

H3 5'- 8 

3'= 8 

H16 12 



H26 
H32 

H38 
H39 
H44 



4 
15 

10 
17 



H45 



L69 13 



L85 10 
L114 6 



GGGCACTGGGGCCACTGAGG 
CCTGGGCACAAGCGGACACC 

CGTCCCGCTGTCTTCTGCTTC 
AGGCGAACCCAGTCGTTCTCC 

TGCAGCTCCAACAACAGCAAC 
CTGCTGCATCGGTGGCTGCTG 

CACCATGTCGCTGAAGCCCC 
CGCCGGGCTTGCGGACATTG 

GAAAGTGACTCCAGAGAC 
GTAAGGTTGCTGAGAGAT 

GGAGACTGGAAGTGTGGT 
CTCCCAGGCATCACAGCA 

GCACAGCAGCAACAAAGG 
GTCCTAAGGGAGACCAAT 



ACAAAAACCCCTTGATGC 
CCTGGAGCTGTGGAGGTG 

GGGTGGCTATGATGATGC 
TGAAGACCTGGGGTTGCT 

ATCCTAATAACGGCACTTCC 
TTCCTGGTGGCGAAATCGTG 

ACATTTCCAGCTTCTCAGGCA 
TCTGTTGCTGCTGCTGCTGTT 



TTAATTTGTGAGGCTTTGAAAAAC 
CTAATAGGAGTATTCATACCAGCA 

CTCTGATTGCTTAGTGGACAA 
TTCAGTACATTGCTGCTGCTG 

CAGCAAGAGCAGTTACAT 
GCTTGCTTTCCAGGATGC 

TGCTGCTGGAAGACAAGC 
GAAAGCAAGCGAAAGAGC 



TGCTGCTGCCGGATGTGGTA 
GCAGGCGTCCTGTCAACA 

AGGTGAGGGAGAGTCCAT 
TATGGGCTCCTGTTCCTG 



GCTCTCGGGCATTGATGTTA 
AGCAGCAGCGGTAGCAGTGG 

GGGACCCAGAACTTGGTTTG 
CGAAGCCTCGGACCTGATTC 



64° 



62* 



62° 



530 
54° 

540 
570 

540 
62° 



58° 



57° 



147 



112 



101 



61° 125 

(7-deaza- 

dGTP) 

49° 145 

47° 82 

(DMSG) 

47° 118 

(DMSG) 



102 
120 

224 
113 

174 
101 



52° 167 

(bp 424- 

590) 
52° . 79 
(DMSG) (bp 581- 

659) 

59° 131 

(bp 2388- 
2518) 

55° 104 



32 

28 

40 

38 

32 
30 
32 

32 
30 

32 
32 

32 

26 

32 
32 

28 
88 



136 



195 



26 



28 



141:0.03 
147:0.66 
150:0.31 

100:0.25 
109:0.68 
112:0.07 

86:0.02 
101:0.80 
104:0.18 

125:0.97 
128:0.03 

145:1.00 

82:0,83 
85:0.17 

118:0.84 
124:0.03 
127:0.06 
130:0.03 
133:0.03 

102:1.00 

120:0.53 
123:0.43 
129:0.03 

215:0,19 
224:0.81 

110:0.53 
113:0.41 
119:0.03 
128:0.03 

174:0.88 
177:0.06 
180:0.06 

89:0,23 
98:0,69 
101:0.08 

167:1.00 
79:1.00 



131:1.00 



83:0.02 
92:0.09 
104:0.20 
107:0.02 
110:0.45 
113:0.01 
116:0.03 
122:0.15 
126:0.01 

133:0.78 
136:0.15 
139:0.08 
195:1,00 



62% 

21% 

20% 

15% 

0% 
20% 
31% 

0% 
60% 

25% 
38% 

25% 

31% 

0% 
0% 

0% 
75% 



46% 



0% 



Table 2 (continued) 



Clone 


Longest 


PGR primers 


Anneal 


Product 


No. of 


Allele bp: 


Hetero- 




nucleotide 




temper- 


length 


chromo- 


frequency 


zygosity 




repeat 




ature 


(in bp) 


somes tested 






L234 


9 


ACAAGAGGAAAAGGGACATAGC 
GCAAAACCAAATACCAGGTCTC 


5r 


94 


32 


124:0.88 
127:0.09 
130:0.03 


14% 


L237 


15 


GGTTCCCTGCACAGAAACCATC 


590 


120 


26 


96:0.04 

QQ.fl (\A 

108:0.04 
117:0.58 
120:0.08 
123:0.23 


33% 



Fig. 1 Sequence and concep- 
tual translation of the gluta- 
mine-rich region of H44. Forty 
consecutive glutamines are en- 
coded by a CAG repeat inter- 
rupted in seven places by 
(CAA)m 



Fig. 2 Homology between the 
mouse numb gene product (m- 
numb) and the conceptual 
translation of CTG3a, There is 
44.6% identity and 85.3% con- 
servation between the se- 
quences 



424 CAS CAA GAG QhQ TTA CAT CTT CAG CTT TTG CAG CAG CAG CAG CAA CAS 471 
QQEQLHLQLL QQQQQQ 

472 CAG CAG CAG CAA CAA CAG CAG CAA CAA CAG CAG CAG CAA CAA CAA CAA 519 
QQQQQQQQQQQQQQQQ 

520 CAA CAG CAG CAA CAA CAG CAG CAG CAG CAG CAA CAG CAG CAG CAG CAG 567 
QQQQQQQQQQQQQQQQ 

568 CAA CAS CAT CCT GGA AAG CAA GCG AAA GAG CAG CAG CAG CAG CAG CAG 615 
QQHPGKQAKEQQQQQQ 

616 CAS CAA CAG CAA TTG GCA GCC CAG CAG CTT GTC TTC CAG CAG CAG CTT 663 
QQQQLAAQ.QLVFQQQL 

664 CTC CAS ATG CAA CAA CTC CAG CAG CAG CAG 691 
LQMQQLQQQQ 

m-numb 278 PFKRQLSLRINELPSTMQRKTDFPIKNTVPEVE GEAESISSLCSQITSAFSTPSEDPFSSA 338 

CTG3a 1 PFKRQLSLRLI^PSTWRRTDFQVKGT\^EMEPPGAGDSDSINAUrr^ISSSFASAGA-PAPG 68 

m-numb 339 PftlTKPVTLVAPQSPVLQGTEWGQSSGAASPGLFQAGHRRTPSEADRWLEEVSKSVRAQQPQVSAA 403 

CTG3a 69 PPATTGTSA WGEPSVPPAAA-FQPGHKRTPSEAERWLEEVSQVAKAQQQQQQQQ 120 

m-numb 404 PLQPVLQPPPPAAIAPPAPPFQGHAFLTSQFVPVGWPPLQPAFVPT-— QSYPVAN— GMPYPA 467 

CTG3a 121 QQQQQQQQQQAASVAPVPTMPPAI^PFPAPVGPFDAAPAQVAVFLPPPHMQS^ 185 



m-numb 468 -SNVPWGITPSQMVANVFGTAGHPQTTHPHQSPSU^QQTFPQYETSSATTSPFFKPPAQHLNG 527 

CTG3a 186 MPRVPWGITPSQMVANAFCSAAQLQ— PQPATLLGKAGAFPPPAIPSAPGSQ ARPRPNG 247 

m-numb 528 SAAFNGVDNGGLASGNRHAEVPPGTCPV-DPFEAQWAALESKSKQRTNPSPTNPFSS 583 

CTG3a 248 AP WPPEPAPAPAPELDPFEAQWAALEGKA TVEKPSNPFCG 280 



formed from the synapsin family. This aligmnent is based 
on the abundance of glutamine and proline residues in the 
query block, and is of questionable biological signifi- 
cance. CTG3a and m-numb share a common PKC phos- 
phorylation site (SLR at CTG3a amino acid residues 7-9) 
and three potential CamKinase II phosphorylation sites 
(CTG3a residues 28-31, 95-98, and 97-100). No other 
motif was detected and no other cDNA had significant ho- 
mologies to other known genes at the DNA or protein 
level. 

Table 2 summarizes the polymorphism analysis of this 
set of loci, as determined by radiolabeled PGR of CEPH 
individuals or individuals with movement disorders of xm- 
known etiology. Heterozygosity ranged from 0% to 75%, 
with a mean of 25%. The 75% heterozygosity of L69 cor- 
responded to the presence of 9 alleles, with the repeat pre- 
sumably ranging in length from 6 to 20 triplets. Three 
other loci, CTG3a, HI 6, and L85 had fewer alleles but 



heterozygosity over 45%. Testing more individuals would 
probably have resulted in the detection of additional alle- 
les. Several of the dories encoding long stretches of 
polyglutamine are not polymorphic; in each case the glut- 
amine repeat is encoded by a combination of CAG and 
CAA codons. 

To test the correlation of repeat length with the extent 
of heterozygosity, the modal repeat length was determined 
for each clone, and also for clones containing CAG re- 
peats previously identified in oiu: laboratory and for genes 
with CAG or CTG repeats in which expansion mutation is 
known to occur (Fig. 3). The repeat in ataxin-2, which ex- 
pands to cause SCA2, is not highly polymorphic, and its 
ratio of heterozygosity to repeat length is similar to a 
number of the clones listed in Tables 1 and 2. All other 
CAG/CTG repeats associated with a disease are highly 
polymorphic with fairly long repeat lengths; the heterozy- 
gosity of L69, MAB21L1 (a gene previously identified in 
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modal repeat length 
(consecutive triplets) 

Fig. 3 The relationship between heterozygosity and modal repeat 
length. The modal repeat length and heterozygosity at the loci for 
CAG-containing clones identified in our laboratory (unfilled circles) 
were combined with similar infonnation for the loci associated with 
CAG or CTG expansion mutations (filled circles). There is a mar- 
ginal positive correlation between the two variables (r^ = 0.28). Note 
that the repeat causing SCA2 is similar in length (8 triplets) and 
heterozygosity (15%) to many of the repeats listed in Tables 1 and 2 



oiir laboratory and originally termed CAGRl, Margolis et 
al. 1996), and CTG-B33 (Li et al. 1993) are in a similar 
range. 



Discussion 

We now report the identification of 19 cDNA clones con- 
taining 20 CAG repeats. The repeats contain a mean of 10 
consecutive triplets, with a maximum of 17. Most of the 
repeats are in open reading frames, contain the triplet 
CAA interspersed with longer runs of consecutive CAG 
triplets, and encode stretches of polyglutamine. The clones 
correspond to loci mapped to 12 different chromosomes 
and have a heterozygosity of up to 75%. 

Four of the clones encode tracts of 25 to 40 consecu- 
tive glutamine residues. These are among the most gluta- 
mine-rich regions yet identified in the proteins of any 
species; in comparison, the ten most significant matches 
(blastp) in GenBank to a 40 glutamine query sequence 
consisted of proteins or protein fragments with 18 to 38 
consecutive glutamines, only two of which were human. 
The longest stretches of consecutive glutamines were 
found in the himian transcription factor TFIID (P20226, 
Q37 or Q38, Imbert et al. 1994) and the yeast transcription 
factor SNF5 (PI 8480, Q37), neither of which is homolo- 
gous to any of the cDNA clones described here. The next 
longest glutamine repeats were present in the mouse mopa 
protein fragment (A26892, CL3YHIRQ7MLRQ27PHQ5) 
homologous to H45 and the uncharacterized yeast protein 



YM8520.13C (S54522, Q30HQ5). The remaining six pro- 
teins, with substantially fewer consecutive glutamines, 
were the Drosophila developmental regulatory protein 
notch (S09358, Q13HQ21), the fungal transcription factor 
white collar-1 (1480115, HQHQ28HQHQ4), a murine AT- 
motif binding factor (1585921, Q19), a long human protein 
fragment nearly identical to H45 (D83783), the Droso- 
phila ecdysone-induced protein E74A (B53225, Qig), and 
a putative African malarial mosquito nucleic acid binding 
protein (S21170, Q20). This list indicates that few gluta- 
mine repeats longer titian 25 residues have yet been identi- 
fied, and that proteins with long glutamine repeats, such 
as the gene products of HI 6, H32, H44, and H45, may 
function as transcription factors. 

Clone H44, located on 6ql4-15, is of particular inter- 
est since it encodes 40 consecutive glutamine residues and 
is present in the same length in all tested individuals. Re- 
peat expansions resulting in as few as 21 consecutive glu- 
tamines in the ttiA-voltage-dependent calcium channel 
causes SCA6, and 35 consectuvie glutamines in ataxin-2 
causes SCA2 (Imbert et al. 1996; Pulst et al. 1996; Sanpei 
et al. 1996). The minimal repeat length for disease onset is 

35 or 36 glutamines in HD (Barron et al. 1993) and 40 
glutamines in SBMA (La Spada et al. 1991) and SCAl 
(Ranum et al. 1995). Normal alleles with more than 35 or 

36 residues have not been detected in DRPLA or MJD, 
though the minimum expansion associated with disease is 
49 glutamines for DRPLA and 68 glutamines for MJD 
(Kawaguchi et al. 1994; Koide et al. 1994; Nagafuchi et 
al. 1994). These fmdings, coupled with our data that a pu- 
tative protein exists with 40 consecutive glutamines and 
the presence of at least 38 consecutive glutamines in 
TFIID, demonstrate that the development of an expansion 
mutation disease is not solely determined by the length of 
the glutamine repeat within a given gene. Other features 
of protein structure or expression must be relevant. The 
available data suggests, however, the presence of a thresh- 
old effect: it may be that a glutamine repeat of 60 or more 
residues in any protein leads to cell damage, whereas 
shorter stretches of glutamines are only toxic in a protein 
v^th certain as yet unidentified structural features or pe- 
culiarities of spatial or temporal expression. Conversely, 
shorter polyglutamine repeats may be inherently neuro- 
toxic, with unknown aspects of structure, function, or ex- 
pression serving a protective role in many, but not all, pro- 
teins. Comparative anylsis of TFIID and H44 with the 
ttiA-voltage-dependent calcium channel, ataxin-2, hunt- 
ingtin, and the androgen receptor in cell toxicity assays 
(Ikeda et al. 1996) may help elucidate these features. 

The CAG repeat-containing clones listed in Tables 1 
and 2 can be considered candidate genes for neuropsychi- 
atric disorders on the basis of both structure and function. 
From a structural standpoint, several of the repeats have 
either a high heterozygosity index (e.g., L69) or a fairly 
large number of alleles (H3 and L237), features of most of 
the repeats known to imdergo expansion mutation. As de- 
picted in Fig. 3, the ratio of modal repeat length to het- 
erozygosity in many of the other cDNA clones is quite 
similar to ataxin-2; like ataxin-2, many of these cDNAs 
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have moderate stretches of CAG repeats interspersed with 
occasional .CAA triplets. The presence of uninterrupted 
CAG repeats in expanded ataxin-1 and ataxin-2 demon- 
strates that CAA can mutate into CAG, leading to repeat 
length instability (Imbert et al. 1996; Orr et al. 1993; Pulst 
et al. 1996; Sanpei et al. 1996). A review of OMIM indi- 
cates that a number of diseases, though none with estab- 
lished anticipation, map near the loci listed in Table 1. Of 
particular interest, two developmental disorders, tripha- 
langeal thumb-polysyndactyly syndrome (OMIM 190605) 
and sacral agenesis (OMIM 176450) map to 7q36 (clone 
F28), noninsulin-dependent diabetes mellitus 1 (OMIM 
601283) and brachydactyly type E map to 2q37 (clone 
L69) progressive foveal dystrophy (OMIM 136550) maps 
to 6ql4-16.2 (clone H44), and type 1 congenital nephro- 
sis, Finnish type (OMIM 256300), maps to the same 
marker on 19ql3 as CTG3a. 

From the standpoint of function, we and others have 
postulated that developmental abnormalities may imderlie 
at least some forms of schizophrenia, autism, and bipolar 
affective disorder (Keshavan et al. 1994; Margolis et al. 
1994; Peterson 1995; Ross and Pearlson 1996; Wein- 
berger 1995). Both Drosophila and murine numb appear 
to specify cell fate during early stages of neurodevelop- 
ment (Spana and Roe . 1996; Zhong et al. 1996). The 
CTG3a gene product may share this capacity, though the 
extent of sequence identity (45%) between CTG3a and 
mouse numb indicates that the CTG3a gene product is not 
the human numb homolog but is instead a numb family 
member. Many of the other putative genes identified here 
may also have a specific role in the process of neuronal 
cell-fate determination, migration, or differentiation through 
regulation of transcription. A mutation, expansion or oth- 
erwise, would serve to alter this function. In particular, 
glutamine repeat length has been shown to modulate the 
efFicieny of transcription (Chamberlain et al. 1994; Ger- 
ber et al. 1994). It is tempting to speculate that, though 
disease is induced by marked glutamine expansions, nor- 
mal variations in glutamine repeat length may imderlie 
normal variations in phenotype, modify the consequences 
of mutations in other genes, or influence the response to a 
nongenetic insult. 
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181 tgtccctggc gtccctgctc ttcttcaccg tgctgctcgc tgaccatctg tggctgtgcg 
241 cgggggcccg gccccgggcc agggagctga gcagcgccat gcggccccca tggggggccg 
301 gccgggagcg gcagccggtg cctcctcgcg cggtgctgcc cgtgccgccg ccgccgcccg 
361 gcgagcccag cgcgccccca ggcacctgcg gccccagata cagcaacctg accaaagccg 
421 cccccgccgc cggctctcgg ccggtctgcg gcggcgtccc agagcccacg gggctggacg 
481 cagcttgcac caaattgcaa tctttgcaga gacttttcga accgactact ccggcccccc 
541 ctctgcggcc ccctgactcc ccttcccgtg ccccggccga gttcccctcc gccaaaaaaa 
601 acttgctcaa aggccacttt cggaacttca ctctctcctt ttgcgacacc tacacggtct 
661 gggacttgct gctgggcatg gaccgccccg acagcctgga ctgtagcctg gacaccctga 
721 tgggggacct gctggccgtg gtggccagcc cgggctccgg ggcctgggag gcgtgtagca 
781 actgtatcga ggcgtaccag cggctggacc gacacgctca ggaaaaatat gacgagttcg 
841 acctcgtgct gcataaatac ttacaggcgg aagagtactc aatccggtcc tgcacgaaag 
901 gctgtaaggc tgtctacaag gcctggctgt gctcagaata cttcagcgtg acccagcagg 
961 aatgccagcg ctgggtgccc tgcaagcaat actgcctgga ggtgcagacc cggtgcccct 
1021 ttatactccc cgacaatgag gaaatggtgt acggagggct ccctggcttt atctgtacag 
1081 ggttgctgga tacttcacca aagcgtctgg aaaccaagtg ctgtgacgtg cagtgggtct 
1141 cctgtgaggc gaagaagaag aagaagttca aggagtctga ggcccccaaa acccaccagc 
1201 agcaattcca ccactcctat ttccaccact accaccaaca gtaccatcac taccaccccc 
1261 atcatgatcc cccaggccgt gtcagcaaca agcccgccct gctgccggtc tctgggggct 
1321 cccgcctcag ccctagcagg atccggctct gcgtccttgt tctcatgctc ctccataccg 
1381 tggtgtcctt ctccagcaac cagggtggtg ggggattggg gctggagaca ctgcctgccc 
1441 tagaggaggg cctgacacgg gaagagtgac agtagggagg gaggacagac ctccaccaca 
1501 ctgacatcag ctccagctcc cccaggttgg gggggagggg ctcctcccat gggaggtgta 
1561 ggataaggtg ggggcggggg aaatggggga atgacacatc ccccccaacc tacccccacc 
1621 ccaaaagcag ctccaacaga atggccagag ggcctcaggg agctgcaaaa ctcatccagg 
1681 agaaaaaggc aggagcagca agtgacccct cccaagctcc catctatggg gcttagcaaa 
1741 aaaagggagg aagtgggggc tggatatgcc cacccctgcc aaaagccctg accccaggga 
1801 aggaggctgc tcacccagcc ttggccctgc agggaatgtt ggggggcaca gaggggagaa 
1861 gctctttccc ccactccaca ttccttttgt tgccaagatc cttaattccc tcctgcccat 
1921 atccctacag gcgacggcag acagtgcaat ggccctcctg ccacttcagc acaccttgcc 
1981 ccacctggga ccatcacacg tgaagcacca ggctgggaga tgaggtgcac acagttgcag 
204 1 ctaggtcggg gccccagtta agctgtgccc caccacccta ggcaatgagg ggcaggaaag 
2101 gggtacagaa tgaatggtga aagagagtgg agatacggaa gggggagagg agaggagaaa 
2161 tgtggacaga gggcttgaag acatggccga ataggctact gccacccggc tttggggaga 
2221 tgggcgataa gtgttgaggt aggctcgaga actgctcccc aaatcagtga gaattgcatt 
2281 aggagcctct ggggaaagag cacaggaact gagttgctgg gcctcaaaac aaaaggctgc 
2341 agcacatgtt agactgccgc tgtctgaaga ccatatctta cagaaggtgc acatactccc 
2401 aagggccact cttgccctgg agatcttggc cttggggcaa agacatttcc atttccgatc 
24 61 tccatgggga ggagggactt aagcccaagt gggtcaggcc tggcccaggt ccttatactc 
2521 ccctatcttg tcctagccca ggcctccatg aaggagaacc tggtggcatt gcccagagct 
2581 ctccagagat gagcctcccc tatccccacc ctgtccccgt ccccagccta ccaaaggtat 
2641 tcacaccttt ccatccctga ctctatgggt tactgtccca attgcgaaac acccatactc 
2701 ctcttcaccc tgtagcaggg tcttctgctc cagtcacccc ggcctcttgc aagagaaagt 
2761 ccagggactg gcaagaagac cctttgacta aagataatgt gctgtcatat cttggctgag 
2821 aaactgttga gtcaggctag aggactggac aagagaagag gcattgagag cccccttggg 
2881 agccattagc cttgctctgg tctgctgtgt gggttgggga gtagggggga tcccacattc 
2941 cctggacatc tgaatgctga tacgaatgga caggaaaggg agccagagct gtatacctgg 
3001 gacaggccag ggagccttta gtgccagcag ggcctgtgtc ctgggatgtg atgctagtga 
3061 gtgaatggga gggcgggcct ggagctgcgt gagagtgaaa ccaagaagcg gatggggaag 
3121 ggagagctgg ccttggtcag cttagggatg aatggagaag gaatcattca tgggcctcgg 
3181 ctaggtccca gactgattac atgagactgg ggagtccttc cctgccagtt ttctcattta 
3241 ttacagccta tcccccaacc ctacccctgg gccctgccta cacactcccg tttggccctc 
3301 agcactttgg gcctggtcac ataccccatg ccctagagtg ctcttttcag ctggggaggg 
3361 gaagagtgct ggctcctttc tagacatgct tatgtataag taaaggggca ggtgcttggg 
3421 ctccagagag accctaggta ctgccttcta actcctgc^at cctcagaagg ggaaaaggga 
3481 ctgggaagta aagggaaacc atatggcccc agggaatgga cccttggaga ctcccatcct 
3541 ctcttccccc tcaccaagtg cccaggagac ccctggctca gaccagccca aggccttgcc 
3601 cagtggcagg ggaaaggggc acctcactcc, acctcaaagt catttgactg cccccatcca 
3661 ccccaccacc ctaatcttcc accacctctc cggctgcccc tgcctcccat cacagacagc 
3721 agagccgggc agctttctta tgccattttc tacactgtgc ttcatgagta ggactttctt 
3781 gcactagttc ctatgactga gtctccaaac tggtttccta gtagtccccc atcccttcct 
3841 cccttaccca gctatgattc agttgtctct gccctccctc ttaccctgcc tctgtgtttc 
3901 ggtgagagtc tctgtatatg tactgctttc tgttttgttg cattgtgggg cagaggcctc 
3961 tctgctcttg tttaacccca taaagaaata aatggtaaga cttgatgata aaaaaaaaaa 
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X-linked Hypohidrotic Ectodermal Dysplasia: Localization 
within the Region Xql I-2LI by Linkage Analysis and 
Implications for Carrier Detection and Prenatal Diagnosis 
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Summary 

X-linked hypohidrotic ectodermal dysplasia (H.E.D.) is a disorder of abnormal morphogenesis of ectoder- 
mal structures and is of unknown pathogenesis. Neither relatively accurate carrier detection nor prenatal 
diagnosis has been available. Previous localization of the disorder by linkage analysis utilizing restriction- 
fragment polymorphisms, by our group and others, has placed the disorder in the general pericentromeric 
region. We have extended our previous study by analyzing 36 families by means of 10 DNA probes at nine 
marker loci and have localized the disorder to the region Xqll-Xq21.1, probably Xql2-Xql3. Three 
loci— DXS159 (e - .01, z - 14.84), PGKl (G = .02, z = 13.44), and DXS72 (0 = .02, z = 11.38)— 
show very close linkage to the disorder, while five other pericentromeric loci (DXS146, DXS14, DXYSl, 
DXYS2, and DXS3) display significant but looser linkage. Analysis of the linkage data yields no significant 
evidence for nonallelic heterogeneity for the X-linked form of the disorder. Both multipoint analysis and 
examination of multiply informative meioses with known phase establish that the locus for H.E.D. is 
flanked on one side by the proximal long arm loci DXYSl, DXYS2, and DXS3 and on the other side by 
the short arm loci DXS146 and DXS14. Multipoint mapping could not resolve the order of H.E.D. and the 
three tightly linked loci. This order can be inferred from published data on physical mapping of marker loci 
in the pericentromeric region, which have utilized somatic cell hybrid lines established from a female with 
severe manifestations of H.E.D., and an X/9 translocation (breakpoint Xql3.1). If one assumes that the 
breakpoint of the translocation is within the locus for H.E.D. and that there has not been a rearrangement 
in the hybrid line, then DXS159 would be proximal to the disorder and PGKl and DXS72 would be distal 
to the disorder. Both accurate carrier detection and prenatal diagnosis are now feasible in a majority of 
famihes at risk for the disorder. 



Introduction 

X-linked hypohidrotic ectodermal dysplasia (H.E.D.) 
involves developmental defects in a number of ecto- 
dermal derivatives, including teeth, hair, and eccrine 
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sweat glands, and can cause significant early morbid- 
ity and mortality (Freire-Maia and Pinheiro 1984; 
Clarke et al. 1987^7). The exact frequency of the dis- 
order is unknown, but a birth prevalence of 1/ 
100,000 males and a mutation rate of 1-5 x 10"^ 
have been estimated (Stevenson and Kerr 1967). 
Heterozygous females manifest overt signs of the dis- 
order in only 70% of cases, and this has made accu- 
rate carrier detection in at-risk females problematic 
(Pinheiro et al. 1981). Analyses of sweat-pore num- 
bers and distribution and radiographic measurement 
of tooth size in at-risk females can aid in carrier 
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identification, but the study of sweat pores, despite 
various proposed methods, has proved quire variable 
and is subject to potential misclassification (Frias and 
Smith 1968; Nakata et al. 1980; Kleinebrecht et al. 
1981; Happle and Frosch 1985). Prenatal diagnosis 
of the disorder has been reported once utilizing a skin 
biopsy obtained by fetoscopy, but the accuracy of 
this procedure has not been validated; nor is the pro- 
cedure widely available (Arnold et al. 1984). 

Recently, three groups of investigators confirmed a 
broad pericentromeric localization for the disorder, 
but without tight linkage to any single marker (Kol- 
vraa et al. 1986; MacDermot et al, 1986; Clarke et 
al. 19876). To further localize the disorder and ex- 
amine the question of nonallelic genetic heteroge- 
neity, the present study undertook to extend our pre- 
vious work by the incorporation of 12 new families, 
nine American and three British, to give a total of 36 
families. Seven new DNA probes were used from a 
total of nine marker loci analyzed. 



Material and Methods 

Family Studies 

The 26 British families were ascertained by one of 
the authors (A.C.) writing to regional genetic centers 
and to medical and dental specialists. Consent to con- 
tact the families was obtained from the medical or 
dental practitioners involved. Samples and clinical in- 
formation on one Swiss family were provided by Al- 
bert Schinzel. 

Nine American families were ascertained through 
the cooperation of the National Foundation for Ec- 
todermal Dysplasias. These families either had previ- 
ously agreed to be contacted for research purposes or 
had read about the study in the foundation's newslet- 
ter. Informed consent for blood sampling and partici- 
pation in the study was obtained in accordance with 
guidelines approved by the Human Research Review 
Board of the Oregon Health Sciences University. 

Affected males and possible carrier females in all 
of the British families were personally examined by 
one of the authors (A.C), and affected males had the 
typical findings of hypodontia, hypotrichosis, and hy- 
pohidrosis (Clarke et al. 1987^^ Affected males in 
five of the nine American families were personally 
examined by one of us (J.Z.), and confirmation of the 
diagnosis in the others was obtained by both personal 
telephone interviews and, when necessary, review of 
clinical records. Potential carrier females in the 



American families were also personally inter\'iewed 
by one of the authors as to manifestations of ectoder- 
mal dysplasia, but personal examination by one of us 
was not always possible. 

Women were scored as heterozygous for H.E.D. 
either on genetic grounds (obligate carriers) or by the 
presence of abnormal dentition (manifesting car- 
riers). Strict minimal criteria of dental abnormalities 
were seleaed for classification as a carrier and re- 
quired the absence of tw^o or more deciduous or per- 
manent teeth (excluding third molars), absence of 
two caniniform deciduous or permanent incisors, or 
marked microdonria of at least four teeth of either 
dentition (Nakata et al. 1980). In a previous study 
urilizing the same minimal criteria for dental abnor- 
malities, 78% of the Brirish obligate carrier females 
had abnormal dentition (Clarke et al. 1987fl). Sys- 
tematic measurement of tooth size on dental radio- 
graphs was not performed. Many of the women who 
fit the dental criteria also had additional physical 
manifestations of H.E.D. , such as thin scalp hair or 
patchy body hair, but these findings were not utilized 
since they were difficult to quanrify. 

Although examination of the pattern of sweating 
on the whole back for evidence of Lyonization had 
been performed previously on a majority of the possi- 
ble British carriers, neither these data nor palmar 
sweat-pore counts were utilized in classification 
(Clarke et al. 1987^?). Clarke et al. (1987^) had found 
that 97% of both the obligate and manifesting car- 
riers (as classified by dental criteria) displayed abnor- 
mal sweat patterns. However, these tests were ex- 
cluded to minimize the possibility of misclassification 
due to subjective interpretation. At-risk females 
without abnormal demition were classified as possi- 
ble carriers and were not used in the linkage analysis. 

Southern Blot Analysis 

High-molecular-weight DNA was extracted, by 
means of published procedures, from blood previ- 
ously stored at -20 C or -70 C (Kunkel et al. 
1977). DNA samples (3-5 ^g) were digested to com- 
pletion for 6-24 h by using 2-4 units of the relevant 
restriction endonucleases/^JLg DNA and were then 
separated by electrophoresis on 0.7%-0.9% agarose 
gels at 1.3-1.6 V/cm for 18-24 h. Transfer was then 
performed to nylon filters, either Hy-Bond® N 
(Amersham) or GeneScreen Plus® (duPont), accord- 
ing to standard manufacturer's procedures. 

Hybridization of the filters with ^32-labeIed probes 
was performed at 65 C for 24 h in 6 x SSC, 5 x 
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jnhardt's solution, 0.5%-l% SDS, 7%-10% dex- 
jn sulfate, and 20 |jLg herring-sperm DNA/ml. 
-> jjfjA probes were labeled by either nick-translation, 
using a commercial kit (Amersham), or by random 
* oligomer extension to give specific activities of 10^- 
2q9 cpmVg DNA (Feinberg and Vogelstein 1983). 
Posthybridization washes were at 65 C, twice at 2 x 
SSC and 0.1% SDS for 10-20 min and subsequently 
I at increasing stringencies (l%"0.17o SSC with 0.1% 
$ SDS), depending on the probe. Autoradiography 
I onto Fuji X-ray film was performed at -70 C with 
I an intensifying screen for 1-7 days. 
f Some initial difficulties were encountered with the 
t use of the phosphoglycerate kinase 1 (PGKl) cDNA 
- probe pHPGK-7e, which detects either a 6-kb or both 
a 2-kb and a 4-kb fragment on Pstl digestion (Hutz et 
I al. 19S4). PGKl is part of a multigene family, and an 
, autosomal band overlays the 6-kb X-derived frag- 
ment, which necessitates a double digest with Xbal 
5 and Pstl to selectively remove the autosomal frag- 
' ment. Despite this treatment a doublet of similarly 
-f^ sized fragments persisted at around 6 kb in a few 
individuals, but the slightly larger autosomal frag- 
ment could be either removed or clearly distinguished 
from the X-linked band following a longer period (24 
h) of enzymatic digestion and electrophoresis for 24 h 
on a 0.7% agarose gel. The typing of all apparent 
heterozygous females was repeated using this second 
method to ensure that no homozygous females had 
been mistyped. Subsequent to these studies we ob- 
tained the genomic pSPT/PGK probe containing the 
5'-flanking region of the PGKl locus, which is also 
polymorphic {Bgll} but does not have any overlying 
autosomal fragments (Keith et al. 1986; Vogelstein et 
al. 1987). Unfortunately the genomic Bgll polymor- 
phism was in complete linkage disequilibrium with 
the cDNA Pstl polymorphism (96 individuals tested), 
but it was simpler to use than the cDNA probe. 

Linkage Analysis 

The nine pericentromeric RFLP loci used in this 
study included three loci located on Xp (DXS7, 
DXS146, and DXS14) and six loci located on Xq 
(DXS159, PGKl, DXS72, DXYSl, DXYS2, and 
DXS3). The DXS7 locus was not tested on the nine 
American families, since no significant linkage was 
apparent after analysis of the British families. Ten 
t^NA probes from these loci were utilized and are 
listed in table 1, which also includes information on 
probe localization, restriction enzymes utilized, and 
allele size and frequencies. Allele frequencies were 
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derived from our 36 families and were calculated 
from 93-112 unrelated X chromosomes. Two-point 
linkage analyses to determine the maximum-like- 
lihood recombination distances (O^ax) and the lod 
scores at G^ax iz) between the main locus (HED) and 
the DNA probe loci were carried out using mlink 
from the linkage package (version 3.5) (Lathrop et 
al. 1984). The confidence intervals (CI) were cal- 
culated as all values of 0 for which the lod score was 
within 1 unit of the maximum (Conneally et al. 
1985). Two-point analyses were initially carried out 
utilizing data on all heterozygous females, including 
manifesting females, and on normal and affected 
males. The analyses were then repeated with a subset 
of data utilizing only the offspring of the 49 hetero- 
zygous females who were obligate carriers, both 
possible and manifesting carriers being excluded, 
to test whether there had been any systematic 
misclassification of females as heterozygotes on the 
basis of physical examination alone. 

Multipoint analyses were performed for HED and 
the marker loci by using ilink (Lathrop et al. 1984). 
The analyses utilized the full set of data, including all 
heterozygous females. We utilized a fixed order for 
marker loci whenever such an order was clearly es- 
tablished in the literature (e.g., DXS7-DXS14- 
DXYSl). We then performed a series of four-point 
analyses utilizing marker loci alone to determine, 
when possible, the position of the other marker loci. 
Subsequent to this, a series of four-point analyses 
were performed utilizing the HED locus and different 
combinations of three marker loci. In all the analyses 
the relative likelihoods (odds) and probabilities of the 
alternative orders were calculated. 

The question of possible heterogeneity was ad- 
dressed by utilization of two programs, homog and 
HOMOG2 (1987 version), developed by J. Ott to test 
for homogeneity (admixture) (Ott 1983, 1985). The 
first program, homog (test I), carries out a ho- 
mogeneity test under the following alternative hy- 
pothesis: two family types, one with linkage between 
HED and the marker locus and the other without 
Hnkage. The second program, homog2 (test II), is an 
extension of the homogeneity test, with the following 
alternative hypothesis: two family types, both with 
linkage and one with theta 1 and the other with theta 
2, where theta 1 < theta 2 < .5. In both of these 
programs the tests of one hypothesis against another 
are carried out as likelihood-ratio tests of het- 
erogeneity (H2) versus homogeneit)^ (HI), with the P 
values calculated from the asymptotic x" distribution 
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Table I 
RFLP Loci 



Locus (Probe), Location, and Enzyme 

DXS7 (Ll,28) Xpn.3: 
Taql 

DXS146 (pTAK8a) Xpll-ql2: 
Xbal 

DXS14 (p58.1) Xpll-cen: 
Mspl 

DXS159 {cpX73) Xqll-ql2: 
Pstl 

PGKl (pHPGK-7e) Xql3: 

PstVXbal 

Pstl ......[ 

PGKl (pSPT/PGK) Xql3: 

Bgll 

DXS72 (pX65H7) Xq2l.l: 
Hwdlll 

DXYSl (pDP34) Xq21.31: 
Taql 

DXYS2 (7b) Xql3-Xq22: 
Pstl 

DXS3 (pl9.2) Xq21.3-q22: 

Taql 



Alleles 




Source 


(kb) 


Frequencies 


(Reference) 


12.0 


.74 


Pearson 


9.0 


.26 


(Wieacker et al. 1984) 


5.0 


.40 


Kruse 


3.3 


.60 


(Kolvraa et al. 1986) 


4.0 


.75 


Latt 


2.5 


.25 


(Bruns et al. 1984) 


5.5 


.59 


Pearson 


1.6 


.41 


(Arveiler et al. 1987fl) 


6.0 


.21 


Orkin 


4.0/2.0 


.79 


(Hutz et al. 1984) 


12.0 


.30^ 


Riggs 


5.0 


.70 


{Vogelstein et al. 1987) 


7.2 


.54 


Schmcckpeper 


.7 


.46 


(Schmeckpeper et al. 1985) 


12.0 


.35 


Page 


11.0 


.65 


(Page et ai. 1982) 


12.0 


.29 


Weissenbach 


9.0 


.71 


(Koenig et al. 1985) 


5.1 


.38 


Bruns 


2.95/2.15 


.62 


(Aldridge et al. 1984) 



of the likelihood-ratio statistic. One should note that 
the tests of homogeneity are one sided and that the P 
values are one-half those associated with x^. These 
analyses were carried out between HED and all nine 
marker loci. The input to the programs consisted of 
lod scores calculated for the values of theta ranging 
from .0 to ,49 (50 points). 

Results 

The results of the two-point linkage analyses for 
the full set of data, including all of the heterozygous 
females, are displayed in table 2. In table 3, analyses 
of the subset of data utilizing obligate carrier females 
only (method B) are compared with the first set of 
analyses (method A). Comparison of the results from 
the full set of data and from the subset of data shows 



some variation in the 0^^^ and z^ax between HED and 
each loci, but the e^^^ values of each are close to oiie 
another and well within the CI of the other analysis. 
This result would generally validate our classification 
methods in both the British and American families. 

Three marker loci localized to the Xql 1-21.1 re- 
gion demonstrated very close linkage with the HED 
locus (table 2). The DXS159 locus (Xqll-ql2) had 
only one phase-unknown recombinant, producing a z 
of 14.84 at a e of .01, The single recombinant indi- 
vidual was an edentulous 60-year-old woman whore 
classification as a heterozygous carrier is uncertain, 
since it is based on a history of two missing perma- 
nent teeth (excluding the third molars), and her den- 
tal records were not available for review. Physical 
examinations of her and of her three offspring (two 
male, 1 female) were otherwise normal. Since in vari- 
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Table 2 

Two-Point Linkage of HED and Marker Loci 



No. OF Recombinant 
Meioses/Total No. 

of Meioses for Phase 6 (in cM) at a Recombination Fraction of 



Locus 


Known 


Unknown 


.01 


.02 


.03 


.04 


.05 


.10 


.20 


.30 


.40 


6max 




DXS7 .... 


7/21 


6/25 


-14.59 


-10.56 


-8.28 


-6.71 


-5.54 


-2.25 


.09 


.63 


.46 


.31 


.64 


DXS146 .. 


. 5/16 


4/41 


-2.12 


.33 


1.64 


2.50 


3.10 


4.42 


4.24 


2.90 


1.30 


.14 


4.59 


DXS14 ... 


. 1/13 


2/34 


8.10 


8.73 


8.98 


9.08 


9.09 


8.55 


6.43 


3.83 


1.28 


.05 


9.09 


DXS159 .. 


. 0/35 


1/33 


14.84 


14.84 


14.70 


14.52 


14.30 


12.99 


9.93 


6.58 


3.19 


.01 


14.84 


PGKl .... 


0/23 


1/33 


13.43 


13.44 


13.33 


13.16 


12.96 


11.76 


8.94 


5.90 


2.85 


.02 


13.44 


DXS72 ... 


. 0/17 


1/28 


11.34 


11.38 


11.28 


11.13 


10.96 


9.87 


7.36 


4.74 


2.22 


.02 


11.38 


DXYSl ... 


. 1/18 


2/31 


6.95 


7.60 


7.87 


7.99 


8.02 


7.62 


5.86 


3.72 


1.64 


,05 


8.02 


DXYS2 - . . 


. 2/24 


6/36 


1.20 


3.31 


4.43 


5.13 


5.60 


6.48 


5.69 


3.76 


1.55 


.11 


6.50 


DXS3 


8/34 


6/45 


-9.42 


-4.95 


-2.49 


-.85 


.34 


3.34 


4.41 


3.35 


1.50 


.18 


4.44 



ous surveys l%-7% of the normal population have 
been found to be missing permanent teeth other than 
the third molars, the possibility exists that this 
woman is not heterozygous for H.E.D. (Bailit 1975; 
Nakata et al. 1980). This individual was also the only 
recombinant between the disease and the PGKl lo- 
cus. One cannot infer gene order from this meiosis, 
since it is a phase-unknown situation, and her carrier 
status is not certain. 

Only one phase-unknown recombinant (one of the 
two sons of an obligate-carrier female) was present 
between HED and the DXS72 locus (XqZl.l), yield- 
ing a 0 of .02 with a £ of 1 1.38. This meiosis was also 
recombinant between HED and the DXYS2 locus. 



The PGKl locus, localized to Xql3, was assessed by 
the use of both a cDNA probe, pHPGK-7e, and a 
genomic probe (pSPT/PGK) from the 5 '-flanking re- 
gion of the gene. A 6 of .02 with a f of 13.44 was 
found, with only one phase-unknown recombinant 
(discussed above). 

On two-point analysis, five other pericentromeric 
marker loci (DXS146, DXS14, DXYSl, DXYS2, and 
DXS3) showed significant but looser linkage to HED, 
with G's of 5-18 cM and f s of 4.44-9.09. Only the 
DXS7 locus showed no significant linkage to HED. 
Examination of multiply informative meioses (phase 
known) suggested that the DXYSl, DXYS2, and 
DXS3 loci are all distal to HED, while the short arm 



Table 3 

Two-Point Analysis of HED versus Marker Lx>ci 



Method B'^ 



Locus 




Method A-" 










No. of 
Informative 
Meioses 


DXS7 


. . .31 


.639 




.21 


.96 




26 


DXS146 


.14 


4.593 


.06-.26 


.11 


3.127 


.03-.27 


33 


DXSt4 


. . .05 


9.087 


<.01-.13 


.00 


8.068 


.00-.06 


32 


DXS159 


. . .01 


14.844 


<.01-.O7 


.00 


9.648 


.00-.05 


39 


PGKl 


, .02 


13.442 


<.01-.07 


.00 


10.156 


.00-.05 


39 


DXS72 


.02 


11.375 


<.01-.08 


.04 


4.163 


<.01-.18 


22 


DXYSl 


, , .05 


8.023 


.01-.14 


.07 


3.391 


<.01-.22 


23 


DXYS2 .... 




6.504 


.05-.21 


.13 


2.233 


.04-.31 


24 


DXS3 




4.439 


.10-,29 


.15 


3.455 


.06-.29 


43 



* Utilizing all heterozygous females. 
Utilizing only obligate-carrier heterozygous females. 
= — 1 Led score. 



80 



Zonana et al. 



Table 4 

Muldpotnt Analyses 



Esrimated 9 

benveen Loci R^Ur;..« n l l t- 

Order of Loci (r\A\ _ f . ^^'^'"^ Probability 

(%) 



DXS72-DXS14-HED-DXYS1 2 0-3 6 50 

DXS14-HED-DXYS1-DXS72 4.4-3.6-2.'l 



DXS146-DXS159-HED-DXYS1 12 8-16-5 0 

DXS146-HED-DXS159-DXYS1 13.8-l.*l-4.7 



DXS146-HED-DXYS1-DXS159 13.8-3.4-3.7 



i^^) -Ln(L) (L) 

A: 

DXS7-DXS146-DXS14-PGK1 175-11 5-3 5 ^7S 4i 

DXS146-DXS7-DXS14-PGK1 133- 75-3 1 ' ^"-^^ 

^DXS7-DXS14-DXS146-PCK1 ^ "^l^^^l J-^J 5.45 

DXS146-DXS14-HED-DXYS1 9.7-4 4-5 ^ Sl«? w 

^DXS146-HED.DXS14.DXYS1 13.8-3.4-4:6 In.V, '^f, 

DXS14-PGK1-HED-DXYS1 --5-1 3 5 5 -g, „ 

DXS14-HED-PGK1-DXYS1 4 6. 4 4 3 -ll f -^^'-'^ ^' ^^ 

HED-DXS14-PGKI-DXYS1 tVijil mfs 

^DXSH-PGKl-DXYSl-HED 3...2.,.3.2 sllfo 'J.o'o .o'j 

DXS14.PGK1-HED-DXYS2 2.5-1.3-11 



94.51 
4,15 



DXS14-HED-PGK1-DXYS2 45. 5- 0 3 9 S 

HED-DXS14-PGK1-DXYS2 3 1 491 -HH '^^^^''^ 

DXS14-PGK1-DXYS2.HED ... f [Itfi ''^^ 

E: " J.b-/.6 806.87 i.qO .00 

DXS14-PGK1-HED-DXS3 2 4 1 7 1/: Q o^, oo 

DXS14.HED.PGK1.DXS3 4 1 t 5 0 lll 'l !oc'''°' ° 

HED.DXSH.PGK1-DXS3 :::::::: Il.Jiu:; 3'- ^ ^^^^ 



6.39 



^DxsM.PCK,.oxs.HEo ::::::::::::::::: -^-j^ 

DXS14-DXS72-HED-DXYS1 3 5-1 6-5 0 «n7 is 

pXS14-HED.DXS72-DXYSl lUji 27 ^ ^i'!^ 



807.38 53.20 46.47 

810.81 1.71 



G: 4.4-3.6-2.1 811.35 i.oo .87 



827.82 4138.13 57.85 



DXS159-DXS146-HED-DXYS1 1 5.7750 IWlt • '''' 



10.89 .15 
S36A5 1.00 .01 



loci, DXS14 and DXS146, flanked the disorder on 
the other side. 

Multipoint analyses, comparing the relative likeli- 
hoods (odds) and probabilities of the alternative or- 
ders of the loci, confirmed that HED is proximal to 
three Xq marker loci (DXYSl, DXYS2, and DXS3) 
and is bracketed by DXS146 and DXS14, which are 
located on proximal Xp (table 4). We first established 
that the DXS146 locus was distal to DXS14 (table 
4A). Subsequent analysis demonstrated that both 
HED and PGKl were between the DXS14 locus and 
the three long arm markers, DXYSl, DXYS2, and 
DXS3 (of the 12 orders calculated, only the four with 
the greatest relative likelihoods are shown; table 4B- 
4E). It was impossible to definitely resolve the order 



of HED and the three tightly linked loci PGKl 
DXS159, and DXS72 (table 4C-4G). There were no 
recombinants between these three marker loci, and 
there were only single recombinants (phase un- 
known) between HED and each of these loci. There is 
a suggestion in the analysis that HED is distal to 
PGKl, with the relative odds ratios between the alter- 
native orders in the range of 5-23:1, but the onlv 
multiply informative recombinant in the analysis is 
from the female whose carrier status is uncertain. The 
analysis revealed that the DXS72 locus is between 
DXS14 and DXYSl and that DXS159 is between 
DXS146 and DXYSl (table 4F, 4G). There were no 
recombinants between the DXS14 and DXS159 loci. 
Estimates derived from multipoint analyses of the re- 
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Table 5 

Coupling of HED and Marker Loci for Three Tightly 
Unked Loci ^ 

Locus and Allele 

(kb) Observed Expected^ P 



DXS159: 




20 




25 




9 


14 


PCKl: 








9 


7 


4,0/2.0 


... 26 


28 


DXS72: 








19 


19 




16 


16 


* Derived from study sample. 



two types of families, one with linkage and one with- 
out, five of the nine markers had a value of 0.000, 
with the maximum possible probability of homo- 
geneity (.5). The second method (test II), which as- 
sumes two types of families, both with linkage but 
with different values of theta, also yields generally 
similar results. The only exception to these results is 
seen in the data for the DXS3 locus. This exception 
may be due to the multiple comparisons performed in 
the analyses, possible heterogeneity of the position of 
the marker locus itself, or incorrect typing of this 
marker locus in a few families. The families that had 
a low posterior probability of linkage between HED 
and the DXS3 locus were informative for other 
marker loci and had a high posterior probability of 
linkage between HED and these loci. 



combination distances between HED and the linked 
marker loci were within 1 cM of those calculated by 
two-point analyses. 

Since three of the markers demonstrated tight link- 
age with HED, we examined the 36 families for evi- 
dence of linkage disequilibrium (table 5). No clear 
evidence for disequilibrium was present for any of the 
markers, which is consistent with the impression of 
a moderately high rate of new mutation among 
families with the disorder. 

Results of the tests for heterogeneity between HED 
and the nine marker loci showed little evidence for 
significant heterogeneity with either of the two 
methods utilized for eight of the nine marker loci 
(table 6). In the first method (test I), which assumes 



Discussion 

The first clue to the regional localization of H.E.D. 
on the X chromosome was the description of a female 
with severe manifestations of H.E.D., a case reported 
verbally at the first Human Gene Mapping confer- 
ence by the late Dr. Peter Cook but not documented 
in the literature (Gerald and Brown 1974). The pa- 
tient had previously been karyotyped for failure to 
thrive at 6 mo of age by other investigators, without 
recognition of her ectodermal dysplasia, and had a 
balanced X/9 translocation, with the breakpoint 
placed at Xql2 (Cohen et al. 1972; Shows and 
Brown 1974). We have had the opportunity to review 
the past records of this patient and have confirmed 



Table 6 

Tests for Heterogeneity between HED and Nine Marker Loci 



Test I Test II 



Marker Locus 


1 

X" 








(No. of Informative Families) 


(df = 1) 


P 


(df - 2) 


P 


DXS7(19) 


.061 


.4026 


.119 


.4712 


DXS146 (26) 


.000 


.5000 


.002 


.4995 


DXSI4 (22) 


.457 


•2495 


.457 


.3978 


DXS159 (25) 


.000 


.5000 


.104 


.4747 


PGKl (22) 


.000 


.5000 


.003 


.4796 


DXS72 (27) 




.4183 


.136 


.4671 


DXYSl (23) 




.5000 


.000 


.5000 


DXYS2(20) 


.000 


.5000 


.000 


.5000 


DXS3 (23) 


4.661 


.0154 


6.347 


.0209 



Note. — Tests of heterogeneity assume that there are two types of families, as follows: for test I, one type is linked and the other type is 
unlinked (theta == .5); for test II, both types have linkage, one rvpe having theta 1 and the other type having theta 2, where theta 1 < theta 2 
<.50. 
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that she had r>^pical manifestations of a male with 
H.E.D. (K. D. MacDermot, personal communica- 
tion); we have also recently obtained a fibroblast cell 
hne of this patient from the National Institute of 
General Medical Sciences' Human Genetic Mu- 
tant Cell Repository (GM0705) and, using high- 
resolution techniques, found the breakpoint actu- 
ally to be at the junction of the Xql2 and Xql3.1 
bands, with a karyotype of 46,X,t(X;9) (Xpter-^ 
Xql3.1::9p24-*9pter; 9qter-^9p24::Xql3.1->Xqter) 
(Zonana et al., in press). Her cell line, also referred to 
as the AnLy line, has been used extensively by a num- 
ber of investigators in the construction of somatic cell 
hybrids for gene mapping purposes, apparently with- 
out realization of the patient's full disease phenotype 
(Shows and Brown 1974, 1975; Grzeschik and Sinis- 
calco 1976; Wieacker et al. 1984; Oberle et al. 1986; 
Riddell et al. 1986; Arveiler et al. 1987fe). 

Another clue to the localization of the disorder is 
the existence of a similar X-linked disorder, the 
Tabby mutant, in the mouse (Blecher 1986). This 
mutant locus has been mapped close to the PGK-1 
locus (Searle et al. 1987). If the Tabby locus is genet- 
ically homologous to the human X-linked HED 
locus, one would predict, on the basis of man-mouse 
X-chromosome homology, that the gene might be 
located on the proximal long arm near the PGKl 
locus (Ohno 1973). 

In 1986 MacDermot et al. suggested linkage of 
HED to the DXYSl locus, with a 0 of .06, and a z of 
2.7. Our present and past studies confirm linkage to 
the DXYSl locus at a distance of 5 cM and demon- 
strate by multipoint analysis that this locus, as well as 
DXYS2 and DXS3, are all distal to HED (table 4). 
This analysis is consistent with reports of two males 
with both choroideremia, but without H.E.D., and 
interstitial deletions within the region Xql3-q22, in 
which one male was deleted for both the DXYSl and 
DXS3 loci and the other for DXYSl, DXYS23 
DXYS5, DXYS12, DXYS2, and DXS3 loci (Hodgson 
et al. 1987; Schwartz et al., in press). Kolvraa et al. 
(1986) previously suggested close linkage of the dis- 
order to the DXS146 locus (Xpll-XqU), with a 9 of 
.0, and a I of 2.41, but our present data indicate 
hnkage at a distance of 14 cM, with the locus located 
distal to DXS14. We had previously demonstrated 
hnkage of the HED locus to the DXS14 and DXS3 
loci and suggested an order of DXS14-HED-DXS3, 
and this has been confirmed by our expanded data 
set, with only minor modifications of genetic dis- 
tances (Clarke et al. 19876). 
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The tightest linkage to HED was demonstrated 
with three marker loci (DXS159, PGKl, and DXS72) 
from the subregion Xqll.Xq21.1. None of these loci 
were deleted in any of our affected males, nor was 
clear linkage disequilibrium apparent (table 5). The 
lack of phase-known recombinants between HED 
and the DXS159 and PGKl loci does not allow us to 
order the loci by linkage analysis. The third locu^ 
closely linked to HED is DXS72. The data suggest 
that this locus is distal to HED and proximal to 
DXYSl, on the basis of a doubly informative meiosis 
recombinant between HED and DXS72 and DXYS2 
and on the basis of another informative meiosis re- 
combinant between DXYSl and both HED and 
DXS72. Although these are phase-unknown meioses, 
the suggested order agrees with information from a 
patient reported with both choroideremia without ec- 
todermal dysplasia and an Xq21.1 interstitial dele- 
tion, deleted for both DXS72 and DXYSl (Lesko 
et al. 1987). DXS72 has recently been assigned, 
by deletion mapping, to Xq21.1, proximal to 
DXYSl, which was placed at Xq21.3 (Yang et al. 
1987). ^ 

The following most likely order of loci can be pro- 
posed on the basis of our linkage data: DXS7 
DXS146 - DXS14 - (DXS159 / HED / PGKl / DXS72i 
(DXYS1/DXYS2/DXS3). A major difficulty in deter- 
mining the exact gene order for HED and the closely 
linked marker loci is the relative lack of genetic 
recombination in the pericentromeric region of the 
X chromosome. Although one may have close ge- 
netic linkage, there may still be a considerable physi- 
cal distance between loci. Physical mapping of 
marker and disease loci will be necessary but has 
been hindered by the lack of any H.E.D. patients 
identified with either cytogenetic or probe deletions. 
However, one can utilize information from published 
somatic cell hybrid studies, which utilized the X- 
autosome translocation from the manifesting female 
with H.E.D. (the AnLy fibroblast line) to attempt to 
order the closely linked marker loci and HED. 

Shows and Brown (1975) first showed, in a mouse 
hybrid, that the PGKl locus mapped distal to the 
AnLy Xql3.1 breakpoint, and this was subsequendy 
confirmed on an independendy derived hybrid line 
(Grzeschik and Siniscalco 1976). If the translocation 
breakpoint is actually located within the disease 
locus, as has been demonstrated in several mani- 
fesring female translocation carriers with Duchenne 
niuscular dystrophy, then, barring a significant 
chromosome rearrangement in the somatic cell lines, 
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we would suggest that the PGKl locus is distal to 
HED (Koenig et al. 1987; Boyd et al., in press). Al- 
though the multipoint mapping weakly suggests that 
HED may be distal to the PGKl locus, this inference 
is based on a minimal number of multiply informa- 
tive recombinants, including one possible recombi- 
nant whose carrier status is questionable, and this 
inference is probably less reliable than the physical 
mapping data. Recenriy, several investigators have 
used panels of X-chromosome deletions and duplica- 
tions in somatic cell hybrids to physically map many 
marker loci to the pericentromeric region (Oberle et 
al. 1986; Arveiler et al. 1987fo; F. P. M. Cremers, 
personal communication). One of the somatic cell 
lines used by each group to delimit an Xql2-ql3 
breakpoint was originally constructed from the AnLy 
fibroblast hne. Their combined data would place the 
DXS159 locus proximal to the Xql3.1 breakpoint 
and would place PGKl and DXS72 distal to the 
breakpoint and suggests the following order: 
DXS159-HED.PGK1-DXS72. 

Finally, interfamilial clinical heterogeneity has 
been observed in X-linked H.E.D., and such variation 
has also been observed in our study (Freire-Maia and 
Pinheiro 1984; Clarke et al. 1987a). The question has 
been posed whether this is due to allelic genetic 
heterogeneity at a single X locus or to multiple X loci. 
In our 36 families the results of the tests for 
heterogeneity between HED and eight of the nine 
marker loci would indicate that H.E.D. is usually 
produced by a mutation at a single genetic locus and 
that interfamilial variability is due to different allelic 
mutations. This information is critical to the clinical 
application of linkage analysis for both carrier detec- 
tion and prenatal diagnosis. 

Eighty-one percent of females heterozygous for 
H.E.D. in our study were informative for one of the 
three marker loci mapped to within 2 cM of the dis- 
order. If one assumes that the AnLy translocation 
places the disorder between DXS159 and PGKl, then 
32% of the carriers have both proximal and distal 
informative flanking markers within 2 cM of the dis- 
order. It is hoped that further genetic and physical 
mapping of the disorder will confirm this critical gene 
order. Linkage analysis of families, with the three 
closely linked marker loci (DXS159, PGKl, DXS72), 
combined, if necf ssary, with analysis of the five addi- 
tional linked loci, should provide both greatly im- 
proved carrier detection for most women at risk for 
H.E.D. and accurate prenatal diagnosis for many 
known carriers of the disorder. 
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Human ^enes containing triplet repeats may mark- 
edly expand in length and cause neuropsychiatric dis- 
ease, explaining the phenomenon of anticipation (in- 
creasing severity or earlier age of onset in successive 
generations in a pedigree). To identify novel genes with 
triplet repeats, we screened a human brain cDNA li- 
brary with oligonucleotide probes containing CTG or 
CCG triplet repeats. Fourteen of 40 clones encoded 
novel human genes, and 8 of these inserts have been 
sequenced on both strands. All contain repeats, and 5 of 
the 8 have 9 or more consecutive perfect repeats. All 
are expressed in brain. Chromosomal assignments re- 
veal a distribution of these genes on multiple autosomes 
and the X-chromosome. Further, the repeat length in 
two of the genes is highly polymorphic, making them 
valuable index linkage markers. We predict that many 
triplet repeat-containing genes exist; screening with 
the CTG probe suggests approximately 50—100 genes 
containing this type of repeat are expressed in the hu- 
man brain. Since additional disorders, such as Hun- 
tington's disease, bipolar affective disorder, and possi- 
bly others, show features of anticipation, we suggest 
that these novel human genes with triplet repeats are 
candidates for causing neuropsychiatric diseases. 

© 1993 Academic Press, Inc. 



INTRODUCTION 

Three neuropsychiatric disorders are caused by ex- 
pansion of unstable triplet repeats in human genes (Gas- 
key et ai, 1992; Davies, 1992; Richards and Sutherland, 
1992). Myotonic dystrophy is an autosomal dominant 
disorder characterized primarily by myotonia and pro- 
gressive muscle weakness. Expansion of CTG triplet re- 
peats in a gene called myotonin protein kinase underlies 
expression of the disease (Mahadevan et ai, 1992; Brook 
et ai, 1992; Harley et al., 1992a). Normals have a range of 
5-30 repeats, whereas patients with myotonic dystrophy 

' To whom correspondence and reprint requests should be ad- 
dressed. Telephone: (410) 614-0011. Fax: (410) 614-0013. 



often have many hundreds of repeats. The degree of ex- 
pansion correlates with severity of the disease and in- 
creases in successive generations (Tsilfidis et ai, 1992). 
This observation explains the previously puzzling fea- 
ture of the inheritance of this disorder, termed "anticipa- 
tion" — decreasing age of onset and increasing severity of 
the disorder in successive generations within a pedigree 
(Howeler etai, 1989; Dyken and Harper, 1973; Harley et 
ai, 1992b). Fragile X syndrome, the most common de- 
fined form of inherited mental retardation, is also 
caused by expansion of an unstable triplet repeat, in this 
case a CGG repeat within a gene known as FMR-1 (Fu 
ai, 1991; Verkerk et ai, 1991; Kremer et ai, 1991). This 
disorder also has unusual features of inheritance consis- 
tent with anticipation, termed the "Sherman paradox" 
(Caskey et ai, 1992; Sherman et a/., 1985). The degree of 
expansion appears to correlate with the extent of pene- 
trance of the phenotype (Caskey et ai, 1992; Yu et ai, 
1992). Finally, Kennedy disease, or X-linked spinal and 
bulbar muscular atrophy, is caused by an expanding 
GAG repeat, coding for polyglutamine, within the first 
exon of the androgen receptor gene (La Spada et ai, 
1991;Tilley eia/., 1989). 

In addition to the genes identified as causing neuropsy- 
chiatric illness, other genes with triplet repeats are al- 
ready known. In drosophila and mouse, genes with GAG 
or CAA repeats encoding the amino acid glutamine are 
called OPA repeats and are frequently found in homeo- 
domain genes or other DNA binding proteins (Wharton 
et ai, 1985; Duboule et ai, 1987). Pen repeats are GGN 
repeats often coding for long glycine stretches (Haynes 
et ai, 1987). 

As discussed below, other neuropsychiatric diseases 
show features of anticipation, suggesting that they could 
be caused by expansion of triplet repeats in currently 
unknown genes. We therefore sought to determine 
whether additional genes with triplet repeats are ex- 
pressed in the human brain. 

MATERIALS AND METHODS 

cDNA cloning. To identify triplet repeat containing genes ex- 
pressed in brain we screened a cerebral cortex cDNA library in lambda 
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ZAP (StraUgene) with oligonucleotides (CTG),o or (CCG),(, using 
standard techniques (Sambrook et ai, 1989). Oligonucleotides were 
labeled with |7-"*^PIATP and polynucleotide kinase to a specific activ- 
ity of 3000-4000 Ci/mmol. The library was plated at a density of ap- 
proximately 50,000 plaques per 150-mm plate. Baked duplicate filter 
lifts, washed with 2X SSC (0.3 M NaCl, 0.03 M sodium citrate, pH 7) 
for 30 min, and prehybridized with a hybridization buffer containing 
5X SSPE (0.9 M NaCl, 5 mM EDTA, and 50 xnM NaH2P0, * H^O), 5X 
Denhardt*s solution (0.1% Ficoll 400, 0.1% polyvinylpyrrolidone, 
0-1% bovine serum albumin), 10 ;ig/ml of denatured salmon sperm 
DNA and 50% formamide at 42*'C for at least 2 h were hybridized at 
42*'C overnight in the same buffer containing labeled COG or CTG 
oligo probes (1-3 X 10** cpm/ml). Hybridization with COG probe was 
carried out at 48'*C overnight. Wash for the CTG oligo was at GO'^C in 
IX SSC/O.5% SDS (IX SSC= 0.15 M NaCl, 0.015 M sodium citrate, 
pH 7) for 20 min X3; wash for the CGG oligo was at 65 ""C in IX 
SSC/0.5% SDS for 20 min X3. Thirty-eight positives were selected 
and plaque purified (14 from the CGG oligonucleotide screening and 
24 from the CTG oligonucleotide screening). The plasmids (pBlue- 
script SK) were excised using a commercial in vivo excision procedure 
(Stratagene) and the ends of the clones were sequenced using T3 and 
T7 primers. 

Both ends of nucleotide sequence from the clones were used for 
homology search against GenBank (release 71) using the FastA pro- 
gram (GCG package, version 7, on a VAX 8530 computer). The inserts 
of clones with ends containing novel human sequences were then se- 
quenced in their entirety on both strands using an ABI automated 
sequencing apparatus by Roxann Ingersoll in the Johns Hopkins Ge- 
netics Core Facility. Nucleotide sequences of the inserts were ana- 
lyzed for open reading frames and predicted peptide sequences for 
hydrophobicity using the MacVector program with a Macintosh Ilci 
computer. The starting codons in the putative peptides were assigned 
as the first methionine after stop codons in all three reading frames. In 
I he clones with a single long open reading frame the peptide sequence 
was also used for a GenBank homology search using the TFastA pro- 
gram. 

Northern blot analysis. To determine whether or not these genes 
were expressed in human brain, we conducted Northern analysis. To- 
tal RNA was extracted from human frontal cerebral cortex, human 
cerebellar cortex, and total mouse brain, using the cesium chloride 
ultracentrifugation method (Davis et ai, 1986). Total RNA was dena- 
tured by heating at 65°C for 10 min with 50% formamide, separated 
electrophoretically on a denaturing formaldehyde agarose gel, trans- 
ferred to nitrocellulose, and probed with 45 mer oligonucleotide 
probes designed to hybridize to a region of the gene excluding the 
repeat. In cases in which the clone had a poly(A) tail, we made a probe 
only in the antisense orientation. In cases in which the direction was 
not established by a poly(A) tail, we made probes in both the pre- 
sumed sense and the presumed antisense orientation based on open 
reading frame analysis. In all cases, Northern hybridization yielded a 
signal with the antisense probe but not with the sense probe. Blots 
were prehybridized with a buffer containing 4X SSC, 0.01 M Tris- 
HCl, pH 7.5, 2X Denhardt's, 200 ;jg/ml denatured salmon sperm 
DNA, and 40% formamide at 42°C for at least 3 h. Oligonucleotides 
were labeled using l7-''^P)ATP and polynucleotide kinase and hybrid- 
ized overnight with the blots in the same buffer plus 10% dextran 
sulfate and 2-3 X 10^ cpm/ml labeled probes at 42*'C. The specific 
activity of the oHgo probes was approximately 3600 Ci/mmole. Blots 
were washed first at room temperature in 2x SSC/0.1% SDS for 15 
min X3 and then, for higher stringency, at 55 to 60°C in 0.2x SSC/ 
0.1-0.5% SDS for 15 min X3. Exposures at -70°C with an intensify- 
ing screen ranged in length from overnight to 4 days. 

Polymorphism analysis. To detect the presence of polymorphisms 
iri the lengths of the repeats we used PGR analysis of human genomic 
DNA from parents in the (^f^PH families (Dausset et ai, 1990). 
Primers were chosen using the program Oligo (National Bioscience 
Inc.) to yield a product of lOO to 200 bases spanning the repeat (Table 
3). PCR was performed using modified techniques of Sambrook et ai 
{ 1989). Briefly, after an initial denaturation step ol'9G°C for 2 min. the 
cycle was 5.S-fi7T annealing for t min (Tai>le 'U, 72**C exten.sion tor I 



min, and 96°C denaturation for 1 min for a total of 30-32 cycles with a 
final extension step for 10 min. 

The amplification of a specific band was first optimized using 
nonradioactive PCR and then analysis was carried out with one of the 
two primers labeled with (7-^*P)ATP and polynucleotide kinase. The 
reaction volume was 25 ^1, containing 200 ng of template DNA, 0.42 
/iM of each primer, 200 /iA/ DNTPs, 1.5 mAf magnesium chloride, and 
0.5 units of Taq polymerase (Perkin-Elmer Cetus). All experiments 
included a negative control using the same conditions and water as a 
template and a positive control using 10 pg to 10 ng of the cloned 
cDNA as a template. The individuals routinely studied were parents 
from 12 randomly chosen pedigrees of the CEPH families (48 chromo- 
somes). For nonradioactive PCR, the products were electrophoreti- 
cally separated on a 2 to 3% agarose gel. For radioactive PCR experi- 
ments, the products were separated on a 6% denaturing polyacryl- 
amide gel and exposed to film for durations ranging from 3 h to 
overnight. 

Chromosomal localization. For chromosomal localization of the 
clones we used a panel of monochromosomal human-rodent hybrid 
cell lines as templates for PCR. Each cell line contains a single human 
chromosome, A negative control with water as the template and a 
positive control with the cloned plasmid DNA as the template were 
included in each experiment. All reaction conditions were optimized 
such that a product formed when human genomic DNA was used as a 
template but no product (or product of a different size) was formed 
when either hamster or mouse genomic DNA at the same concentra- 
tion served as the template. The PCR conditions were the same as 
described in polymorphism analysis (with 100 to 250 ng of DNA for 
each template). The products were electrophoreticaUy separated on 
an agarose or acrylamide gel as described above. Each chromosomal 
localization experiment was repeated at least twice. 

RESULTS 

Cloning of Novel Human Genes with Triplet Repeats 

To identify genes with triplet repeats expressed in the 
human brain, we screened a human frontal cortex cDNA 
library with oligonucleotide probes containing 10 re- 
peats of either CTG or CGG. There were approximately 
100 positives per 150-mm plate (of about 50,000 plaques) 
in the initial screen for the CTG probe and many more 
positives for the CGG probe. 

To form a more accurate estimate of the number of 
clones containing triplet repeats, filters were lifted from 
lower density plates (5000-6000 plaques). Of these, 
0.28% were positive with the CTG probe and 4.7% were 
positive with the CGG probe. At least four-fifths of the 
latter, however, encoded 28S ribosomal RNA, as demon- 
strated by hybridization with a 28S ribosomal RNA oli- 
gonucleotide pool probe. 

Of our initial screen, 40 positives were selected for 
plaque purification. Two did not show a strong signal on 
rescreen and were discarded. The sequences of the re- 
maining clones indicated that 12 coded for 28S ribo- 
somal RNA, which is known to have CGG repeats (Gon- 
zalez et ai, 1985). Nine were previously known genes or 
were human homologues of known mammalian genes 
(Table 1) including human apoferritin, sodium potas- 
sium ATPase, and calcium-dependent protease (Cos- 
tanzo et ai, 1984; Lane et ai, 1989; Ohno et ai. 1986). 
T*he first clone sequenced, CTG-A2, was myotonin pro- 
tein kinase, with 13 CTG repeats. Three of the clones 
could not be readily sequenced. Fourteen were novel hu- 



HUMAN BRAIN GENES WITH TRIPLET REPEATS 



575 



TABLE 2 

Character ization of the Clones 

mRNA kb Chromosomal Number of Percentage of 
Clones (exposure) localization alelles heterozygosity 



CCG-A3 

CTG-A4 

CTG-Bl 

CCG-B7 

CTG-B33 

CTG-B37 

CTG-B43 

CTG-B46d 



4.5 (2 days) 
5.0 (3 days) 
5.5 (4 days) 

2.3 (2 days) 
1.2 (2 days) 

2.4 {3 days) 
2.0 (4 days) 
9.0 (3 days) 



6,6 
1 

X 

3, (12) 

12 

16 

10. (20) 



1 
2 
2 

ND 
7 

13 
1 
1 



0 



ND 
70 
85 
0 
0 



Note. ND, not done. 

Chromosomal Localization 

For the chromosomal assignment of these genes, we 
used DNA from rodent-human hybrid cell lines con- 
taining a single human chromosome (Warburton et ai, 
1990; Theune et ai, 1991) as PGR templates (see Table 2 
and Fig. 3). Each clone was assigned to a different auto- 
some, except for the assignment of CCG-B7 to the X 
chromosome. Three genes yielded products with DNA 
from two chromosomes by several different primer pairs, 
suggesting the existence of pseudogenes or closely re- 
lated genes (on chromosomes 5 and 6 for CCG-A3, on 
chromosomes 3 and 12 for CTG-B33, and on chromo- 
somes 10 and 20 for CTG-B45d). Radioactive PGR with 
DNA templates from monochromosomal rodent-human 
hybrid cell lines showed that the products of chromo- 
somes 3 and 10 are the same size as of the products of 
clones GTG-B33 and GTG-B45d, respectively. The prod- 
ucts of chromosomes 12 and 20, respectively, were of 
different size and thus are shown in parentheses in Ta- 
ble 2. The products of both chromosome 5 and chromo- 
some 6 showed the same size as clone GCG-A3 and thus 
both are shown without parentheses (Table 2). 



Polymorphisms in Lengths of the Triplet Repeats 

To analyze the lengths of the triplet repeats in a sam- 
phng (48 chromosomes) of the normal human popula- 
tion, we used PGR primers spanning the repeats with 
templates from a selection of parents in the GEPH pedi- 
grees (Table 2 and Fig. 4). Two of the repeats were very 
highly polymorphic. 

CTG-B37 had 13 different alleles with 85% observed 
heterozygosity in an analysis of 24 GEPH parents (Fig. 
4a). The copy numbers of the GAG repeat ranged from 9 
to 23. 

Clone GTG-B33 had 7 alleles on chromosome 3; about 
70% of individuals were heterozygous. In addition, the 
cross-reacting band on chromosome 12 also revealed he- 
terozygosity, with 3 or 4 alleles and 20% heterozygosity 
(Fig. 4b). 

Glone GTG-Bl showed about 50% heterozygosity 
with 2 alleles (not shown). Clone GTG-A4 showed only 
slight heterozygosity (20%) with 2 alleles and clones 
CCG-A3, GTG-B45d and GTG-B43a were not polymor- 
phic (not shown). 

To confirm the inheritance of the different alleles of 
clone GTG-B37. we conducted PGR analysis in the 
GEPH family No. 1347 (Fig. 5). The alleles were in- 
herited in a Mendelian fashion through all three genera- 
tions. Preliminary hnkage analysis is consistent with 
the assignment to chromosome 12. 



DISCUSSION 

In this study we have identified 14 novel human genes 
containing triplet repeats and analyzed 8 of them. We 
find that these genes are widely dispersed among the 
human chromosomes. They are expressed as scarce to 
moderately abundant messages in human cerebral and 
cerebellar cortices. In several cases the lengths of the 
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Clone 



CCG-A3 

CTG-A4 

CTG-Bl 

CCG-B7 

CTG-B33 

CTG-B37 

CTG-B43 

CTG-B45d 



TABLE 3 
Primers for PGR 



Primers 



Size of 
products (bp) 



Annealing 
temperature (**C) 



5'(296)TCGAGACCAGAACGGCCTTTCCAA 

5'(547)TATTACTGCGACCGCCCCAGCTAC 

5'(302)CTGATCACTTGTGGTTCTGCGCCG 

5'(485)AGGACTCCCCCATGCTCGCCAG 

5'(270)AGTTCCGCCAGCCCTTGAGAG 

5'(381)GGGAGCTGCCTGCCTCTTGAT 

5'(153)CTGGCTGTTGGAGTCTGGGGC 

5'(316)GCATCCATGGGCTACCACGAC 

5'(425)CAAAAAAGCACCTGGTACTAA 

5'(541)GGGCTGGAGCCTTTTTACTCGC 

5'(699)CACCAGTCTCAACACATCACCATC 

5'(840)CCTCCAGTGGGTGGGGAAATGCTC 

6'(865)CCCACCATCTTCCCGCTGC 

5'(980)GCAGCTGGGCAGTCTGGGC 

5'(463)CTCCAGGAAGAGGAAGAAAGC 

5'(619)CTGTGGCTCAACCCTCATTTC 




250 

111 
163 
116 
141 
115 
156 



59 



62 
63 
59 
62 
55 
65 



P""^ers for chromosomal assignment and length polymorphism. Primers for clones CTG-R? nnH PTP a 



repeats are highly polymorphic, making them valuable 
as PCR typeable linkage markers. 

Previous studies have found that triplet repeats are 
enriched relative to expected rates in genomic DNA 
AAC, CCG (=:=CGG), AGC (=CTG), and AAT are the 
most common motifs found in sequence data bases 
(Tautz and Renz, 1984; Beckmann and Weber, 1992), 
One previous study has used oligonucleotide probes to 
identify several new GAG repeats in genomic DNA; how- 
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ever, no mRNA transcripts were found in various hu- 
man tissues (Zischler et al, 1992). Another group has 
used PCR to identify simple sequence repeats in human 
genomic DNA (Edwards et al, 1991). Like minisatel- 
lites, triplet repeats have been found to be widely dis- 
persed in human genomic DNA (Zischler et al, 1992; 
Edwards et al, 1991; Tautz, 1989; Wong et al, 1987). We 
.now find that genes containing triplet repeats are also 
present on many different chromosomes. 
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20. hut radioactive PCR indicates that the alMerr ™n ^ 7^^ f '° chromosome 10. (There is also a band at chromosome 

arc the same as in (a) " corresponding to the clone are on chromosome .0 as shown in Fig 4 for CTC.833.) The lanes 
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FIG. 4. Repeat lengths of two triplet repeats are highly polymor- 
phic. Radioactive PGR was performed using the primers spanning the 
repeats in CTG-B33 or CTG-B37 with DNA from parents in the 
CEPH pedigrees (Nos. 1413, 1416, 1362. 1421, 66, 1418, 12, 884, 13291, 
1350. 1331. 1332). Clone CTG-B37 has 13 alleles and is highly poly- 
morphic (a). For clone CTG-B33 (b) the highly polymorphic allele is 
on chromosome 3; lane 3 is the amplification of chromosome 3 hu- 
man-rodent hybrid DNA; lane 12 is the amplification of chromosome 
12 human-rodent hybrid DNA; and lane C is the amplification of 
clone CTG-B33 plasmid with radiolabeled PGR primers. CTG-B33 
has approximately 7 alleles. 



A recently published paper presents data consistent 
with those of the current study (Riggins et al, 1992). The 
lengths of the triplet repeats of several genes in the data- 
base were found to be highly polymorphic. In addition, 
Riggins et al (1992) screened a number of cDNA librar- 
ies with (CAG)io and (CGG),o, finding a similar abun- 
dance of positives in fetal brain as we find in adult brain, 
but finding 5- to 10-fold fewer positives in libraries from 
various cell lines and muscle and testis. The lengths of 
the repeats of these clones were shorter, and only 
slightly polymorphic, perhaps because the screening was 
done at a lower stringency than in the current study. 

The present study establishes that the novel genes 
with triplet repeats are expressed as mRNAs in human 
brain. Based on the results of screening reported here we 
estimate that on the order of 50-100 novel genes con- 
taining CTG or GAG repeats are expressed in human 
brain (assuming that 50,000-100,000 genes are ex- 
pressed in brain, that the representation of the triplet 
repeat-containing genes parallels that of all genes in the 
library, and that about one-sixth to one-third of posi- 
tives are independent genes). The number of CCG or 
CGG repeats is harder to estimate, since so many of our 
positives were 28S ribosomal RNA, but could be greater. 

In 7 of the 8 clones in which we sequenced the full 
insert, the triplet repeat was contained within an open 
reading frame. Thus, it is likely that the repeats are 
translated into amino acids, including glutamine, gly- 
cine, proline, and serine (Fig 1). Hydrophobicity analy- 
sis suggests that all except one (CCG-R7) oi the proteins 
are soluble. This is in contrast to the repeats of Riggins 
et al. (1992), which were mostly in the 5' untranslated 
region. This could be l)ecause most of our novel se- 
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glutamine repeats, important for transcription (Kadon- 
aga et aL, 1988; Courey and Tijan, 1988). Thus some of 

rlonpfi with crliit^mino T-^ar^^rtj-^ f 



— , — iijau, j.;7oo;. 1 nus some oi 

our clones with glutamine repeats may code for tran- 
scription factors. Other repeats may be involved in pro- 
tein-protein interaction. For instance an 11-proline re- 
peat in human calcineurin A has been proposed to be 
involved in binding to calmodulin (Guerini and Klee 
1989). * 

Disorders other than those described in the introduc- 
tion may have patterns of inheritance consistent with 
anticipation. In Huntington's disease, offspring of af- 
fected males have younger ages of onset than their par- 
ents (Ridley et aL, 1988). The presence of individuals 
with schizophrenia spectrum" disorders in families of 
probands with schizophrenia (Kendler et al , 1985) sug- 
gests variable penetrance, and the risk of developing 
schizophrenia for children of probands with schizophre 

^'^'^ ^^^"^ P^^«"ts (Gottesman. 
1991). Our recent study of pedigrees with unilineal 
transmission of bipolar affective illness suggests that 
disease severity increases and age of onset decreases in 

, ?'?Jf«T'^/'"°'" generation to the next (Depaulo 
et ai, 1992; Mclnnis et aL, 1992). 

Our study shows that screening cDNA libraries with 
oligonucleotide probes can be used to identify novel 
genes with triplet repeats. Some of these genes may code 
for transcriptional regulators or other nucleotide bind- 
ing proteins Based on our experience many are likely to 
be highly polymorphic, making these genes valuable link- 
age markers for studies of the human genome. Finally as 
we suggest above, additional neuropsychiatric disorders 
appear to have features of anticipation consistent with 
causation by expansion of triplet repeats. Therefore we 
suggest that novel human genes with triplet repeats may 
be candidate genes for neuropsychiatric disorders 
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Genes with triplet repeats: candidate mediators of 
neuropsycliiatric disorders 

Christopher A. Ross, Melvin G. Mclnnis, Russell L. Margolis and Shi-Hua Li 



Recently a new form of human mutation - expansion of trinucleotide 
repeats - has been found to cause the diseases of fragile X syndrome, 
spinal and bulbar muscular atrophy, myotonic dystrophy and, most 
recently, Huntington's disease. We review the emerging data on the 
genetics and neurobiology of these disorders. Three are characterized 
by unusual patterns of inheritance, in particular, genetic 'anticipation ', 
in which the severity of the disorder increases and the age of onset 
decreases in successive generations of a pedigree. Several idiopathic 
neuropsychiatric disorders have features of inheritance consistent 
with anticipation. In bipolar affective disorder, there is evidence for 
both earlier age of onset and more severe illness in the second 
generation of a subset of unilineal pedigrees. There is also the 
suggestion of anticipation in some forms of schizophrenia, spino- 
cerebellar atrophy and autism. Triplet repeats are present in additional 
known genes, both in coding regions and untranslated regions. 
Furthermore, many novel genes with triplet repeats are expressed in 
the human brain, and these are candidates to cause some forms of 
these neuropsychiatric disorders. 

Trinucleotide (or triplet) repeats are repeated units 
of three nucleotides present in genomic DNA. When 
they occur in the coding regions of genes, they can 
direct the synthesis of repeating single amino acids. 
The most common triplet repeats in human DNA 
included in the Genbank database are CGG, GAG, 
CAA, TAA and GAG (Ref. 1). (Note that in double- 
stranded DNA, any triplet could be read in six 
possible reading frames; GAG is equivalent to AGC, 
GCA, CTG, TGC or GCT.) Since the lengths of these 
repeats (like those of other microsatellites, or short 
repeating sequences of DNA) at any given locus are 
often polymorphic - variable from individual to 
individual - they are useful as linkage markers in the 
Human Genome Project Otherwise, they appeared 
to be curiosities, until four neuropsychiatric dis- 
orders were recently discovered to be caused by 
expansion of unstable trinucleotide repeats (recently 
reviewed by Davies^, Richards and Sutherland^ and 
Caskey et a/."*). 

Myotonic dystrophy 

Myotonic dystrophy is an autosomal dominant 
disorder characterized primarily by myotonia and 
progressive muscle weakness, and is the most 
common form of adult-onset muscular dystrophy^. 
There may be cognitive changes, including mental 
retardation, as well as skeletal, cardiovascular and 
ocular manifestations. The disorder shows 'antici- 
pation', with severity increasing over three or four 
generations, presenting as cataracts or mild alter- 
ation in muscle contractility in late adulthood of the 
first generation, moderate weakness in an inter- 
mediate generation, and progressing to a more 
severe disease state including neonatal onset with 
mental retardation in a later generation^. These 
aspects of inheritance were so unusual that they 
were attributed to artifacts of ascertainment bias 
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(probands coming to medical attention tend to 
present a 'late-generation' form of the disorder and 
so are more severely affected than other members 
of their family) or other artifacts^, and only recently 
have been clinically confirmed as anticipation^. 

The gene that causes myotonic dystrophy, called 
DM-1 or myotonin protein kinase, has recently been 
identified using positional cloning and, in one case, 
direct screening for triplet repeats; this gene con- 
tains a CTG repeat In Its 3' untranslated region (see 
Fig. 1)^''\ The length of the CTG repeat sequence 
Is highly polymorphic in the normal population, with 
a range of 5 to 30 repeats. Patients have expanded 
repeats, often with many hundreds of repeats, and 
the extent of expansion correlates with severity of 
the disease''^'''^. The gene has regions of strong 
homology to cAMP-dependent protein kinase''^''''' 
and is expressed in brain, heart and muscle''^. 

Fragile X syndrome 

Fragile X syndrome is the second most common 
genetic form of mental retardation (after Down 
syndrome), occurring In about one of 2000 births. It 
was originally identified by the association of a 
folate-sensitive fragile site on the distal end of the 
long arm of the X chromosome (band Xq27.3) with 
mental retardation. Hemizygous males have the 
fragile X locus on their single X chromosome and 
generally have moderate to severe retardation, 
often with pervasive developmental delays and 
autistic behavior'*''"'*. Physical signs may include 
alterations such as testicular enlargement, anatom- 
ical defects and connective tissue abnormalities. 

The pattern of inheritance in families with the 
fragile X marker shows a form of anticipation, with 
increasing penetrance in succeeding generations. 
Passage through a female increases the risk in the 
next generation for children to be affected. These 
two characteristics comprise the so-called 'Sherman 
paradox' - the first-degree male relatives of trans- 
mitting males have less chance of being affected than 
male grandchildren of such males^'^'*'''^. Obligate 
female carriers (some of whom presumably have the 
full mutation) of the fragile X chromosome are not un- 
affected, but may only have mild cognitive defects and 
psychiatric features such as schizotypal symptoms'"^. 

The genetic basis of the fragile X syndrome is the 
expansion of a triplet repeat in a gene called FMR-1 
(see Fig. ^)'^•^'^~^\ The full-length sequence for the 
transcribed portion of the gene has not yet been 
published (it is a 4.8 kb mRNA). Within what is 
believed to be the 5' untranslated portion of the 
message is a region of multiple repeats of CGG. In 
the normal population, the length of the repeat is 
highly polymorphic, ranging between 6 and about 
42 triplets. In pedigrees with the fragile X syndrome, 
the CGG repeat can become unstable and then its 
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length expands during transmission from one gener- 
ation to the next. Unaffected members of the 
family, like members of the general population, 
have less than 50 repeats, or may carry a so-called 
•premutation' with 50-200 repeats. The premu- 
tation is unstable, and can expand to an extent 
sufficient to cause disease in subsequent gener- 
ations. Affected members may have expansions of 
up to several thousand bases. The size of the 
premutation in women correlates with their likeli- 
hood of bearing affected children. 

In the normal population, mRNA of the FMR-1 
gene is expressed at the highest levels in the brain 
and the testes, two of the major organs affected in 
the fragile X syndrome. Expression in the brain 
appears primarily neuronal and generally parallels 
neuronal density, with high levels found in the 
cerebellar granule cell layer, hippocampus and some 
or all neurons in cerebral cortex^^. In addition, 
relatively high levels of expression are present in 
ovarian follicle and the eye, and the mRNA is also 
widely expressed in early embryonic development. 
The function of the FMR-1 gene is unknown. It has 
no homology to other known sequences, except in 
the area of the repeats itself. The predicted protein 
structure does contain a region coding for a sequence 
of amino acids (KKXK) that might cause nuclear 
translocation of the protein, suggesting the gene 
could code for a DNA-binding protein or some other 
nuclear regulatory protein. 

Spinal and bulbar muscular atrophy 

X-Linked spinal and bulbar muscular atrophy (also 
called Kennedy's disease) is a rare, late-onset form 
of motor neuron degeneration^^ that may be 
accompanied by mental retardation and partial 
insensitivity to androgens. Because the human 
androgen receptor gene is located on the X chromo- 
some, it seemed possible that it represented the site 
of mutation and so was studied in patients with this 
disorder. An expanded CAG repeat was found; 
patients had numbers of repeats ranging from 40 to 
52 while controls had a range of only 17 to 26 (Ref. 
24). There was no overlap in the number of triplet 
repeats between the patients and control samples, 
and no alteration in any other region of the 
sequence of the androgen receptor was found, 
strongly suggesting that expansion of the triplet 
repeat accounts for the disease. 

Like other steroid receptors, the androgen recep- 
tor is a DNA-binding protein involved in regulation 
of transcription. The CAG repeat is in the first exon, 
outside of the region that encodes either DNA- or 
hormone-binding activity^"*. In the reading frame of 
the androgen receptor, CAG repeats are presumably 
translated into a polyglutamine tract. The phenom- 
enon of anticipation has not been clearly observed 
in spinal and bulbar muscular atrophy, but the 
degree of expansion does correlate with severity of 
the illness25.26^ 
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Fig. 1. Diseases caused by expansion of trinucleotide repeats. On the left, a 
schematic diagram of each gene, with the repeat region depicted as an 
inverted pyramid and the numbers of repeats beside it Open regions represent 
the normal variation of repeat number; blue regions represent the range of 
abnormally long repeat sequences that are unstable but not causative of 
prominent clinical symptoms (premutations): and red regions represent 
expanded repeat lengths that cause disease. A representative pedigree for 
each disorder is depicted to the right Open, blue and red symbols represent 
unaffected individuals, individuals with premutations, and affected individuals, 
respectively. The number of repeats in each allele is displayed below the 
symbol. Circles represent females, squares represent males, diamonds or 
triangles represent individuals whose sex is not identified in order to preserve 
confidentiality, with a number inside to indicate more than one person. A dot 
inside a circle indicates a earner. (Gene diagrams are adapted from Ref. 3, and 
pedigrees are from Refs 4, 25 and 31.) 



Huntington's disease movement disorder, emotional disorder and de- 

Huntington's disease is an autosomal dominant mentia^^. Affected children of affected fathers 
syndrome linked to chromosome 4p, involving have an age of onset eight to ten years earlier than 
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their fathers, while affected children of affected 
mothers have an onset age similar to their mothers. 
This phenomenon appears to continue through the 
generations, with some children of affected fathers 
having juvenile onset^^'^^. These features led us to 
suggest^° that Huntington's disease might be 
caused by expansion of a triplet repeat in a gene on 
chromosome 4. 

Very recently, the Huntington's disease Collabor- 
ative Research Group has used exon trapping (a 
technique for extracting exons from genomic DNA 
cloned in vectors such as cosmids) to identify a gene 
from chromosome 4p, termed IT15 ('interesting 
transcript 15'), containing a repeating CAG triplet^''-^^. 
Normals have between 11 and 34 repeats with a 
median of 19, while patients with Huntington's 
disease have between 37 and 86 copies with a 
median of 45. The CAG repeats appear to be in the 
5' portion of the coding region, which yields a 
protein with a deduced molecular weight of about 
348 kDa. In the predicted reading frame, the CAGs 
code for polyglutamine, like in the androgen recep- 
tor gene. The gene appears to be widely expressed 
in brain and peripheral tissues as an 11 kb transcript. 
Patients with longer repeats have an earlier age of 
onset with a high correlation (r = 0.75) between 
length of repeating region and age of onset^'''^^. 
However, for most Huntington's disease alleles 
(which have between 37 and 52 repeats) there is 
such a wide range of ages of onset that the length of 
the repeat cannot be used to predict the age of 
onset for individual patients. The length of the 
Huntington's disease allele is highly unstable in 
vertical transmission, with a difference between the 
two sexes. During female transmission, the length of 
the Huntington's disease allele frequently changes, 
but only by a few repeats, so that the mean length 
of the allele in the child is very similar to that in the 
mother. However, in paternal transmission, the 
repeat lengthens dramatically in about one third of 
the cases to about twice that of the paternal allele. 
This appears to occur during spermatogenesis^"". 

Pathophysiology 

These diseases represent a new class of disorders 
caused by alterations in unstable DNA sequences. 
Thus, rather than having fixed mutations these 
diseases are caused by so-called 'dynamic' mu- 
tations^ that can change in each generation, 
accounting for the unusual aspects of inheritance, 
especially anticipation. The mechanism by which the 
triplet repeats expand is unknown. Unequal crossing 
over during recombination could not account for the 
large expansions from hundreds to thousands of 
bases that are observed. There is an enzyme 
(telomerase) that inserts multiple copies of a six- 
base-repeat into telomeric DNA (Ref. 32), but no 
enzymes that could insert a three-base-repeat into 
genomic DNA are currently known. It has been 
proposed that slippage of a DNA polymerase during 
DNA replication may be involved^^. 

In myotonic dystrophy and fragile X syndrome 
there appear to be two separate mutational events. 
In the normal population, the length of the triplet 



repeat is polymorphic, but stably inherited. The first 
event is the formation of a 'premutation' or allele 
with a slightly longer repeat, which appears to be a 
rare event since founder chromosomes (haplotype 
evidence of a single event on an ancestral chromo- 
some) have been suggested for both disorders^'^*^^. 
Population genetic modeling of the fragile X syn- 
drome suggests that the premutation persists in the 
population for a mean of 60 generations^^'-^^. The 
premutation renders the DNA sequence unstable 
such that it can then expand to a much greater 
length, and cause disease. Once the expansion has 
taken place, it becomes increasingly unstable - the 
longer the expansion, the more it can expand in 
subsequent generations, with the cycle presumably 
ending when the disease becomes so severe as to 
prevent reproduction. Rare contractions of an ex- 
panded allele back to normal length (along with 
normal phenotype) have been described for myo- 
tonic dystrophy^^'^^; the mechanism may involve a 
gene conversion evenf**^. 

In both myotonic dystrophy and fragile X syn- 
drome, the repeat is more likely to expand during a 
female meiosis. In fragile X syndrome there appears 
to be a relationship between methylation of a CpG 
island in the region 5' to the gene and expansion of 
the repeat. Which comes first is currently un- 
certain'*'', though indirect evidence suggests that the 
expansion is the primary event^'*-'*^. In Kennedy's 
disease, by contrast, the expansion behaves more 
like a premutation seen in the other disorders; large 
expansions do not occur, but there is mild instability 
of the length of the repeats, which can yield both 
small expansions and small contractions^^-^^ \^ this 
disease the repeat length appears more unstable in 
male meiosis. 

It is uncertain how the expanded triplet repeat 
causes these diseases. Large expansions may make it 
impossible for the message to be transcribed or 
translated, possibly related to the hypermethylation 
of the CpG island. Pieretti et al."^"^ have found that 
the FMR-1 mRNA is absent in the majority of male 
fragile X patients. In the case of fragile X syndrome, 
simple absence of the message may cause symp- 
toms, since other mutations in the same gene, such 
as deletions'*^ or a point mutation'^^ appear to 
cause a phenotype similar to fragile X without 
expansion of the CGG repeat, presence of the 
folate-sensitive fragile site or methylation of the 
CpG island. 

To our knowledge, no pathological studies of the 
brains of fragile X patients have yet been reported. 
A study using magnetic resonance imaging (MRI) 
found selective atrophy of the cerebellar vermis in 
fragile X patients compared to either normals or 
patients with other developmental disabilities'*^. 
How this relatively selective effect results from a 
mutation in a gene expressed widely in the brain^^ is 
unclear. 

In the case of myotonic dystrophy, mRNA levels 
for the expanded allele are decreased, dramatically 
in congenital cases, while levels of the mRNA 
corresponding to the normal allele appear un- 
changed''®''*^. Overall protein levels are modestly 
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'decreased. How these changes cause the domi- 
nance of the gene is unclear, but it may be a dosage 
effect"*^. Normal expression of the DM-1 gene is 
just beginning to be defined (Schalling, G,, pers. 
commun.). Like the FMR-1 message, it is widely 
expressed in brain, but apparently with more 
regional selectivity; for instance, it appears to be 
relatively highly expressed in cerebellar Purkinje 
cells. Several splice isoforms of the DM-1 mRNA 
have been described, but their biological signifi- 
cance is as yet unknown'*^'^^. The means by which 
expansion of repeats in the 3' untranslated region 
yields decreased levels of message is not clear, but 
this region in at least some genes may have an 
influence on regulation of mRNA (Ref. 51). 

In the case of the gene for the androgen receptor, 
which has the repeat in a coding region, the effect 
of the expanded triplet repeat on protein function 
has been studied^^, in cell culture, with normal or 
expanded androgen receptor cDNA and a reporter 
gene coupled to the androgen receptor response 
element. Although androgen binding was normal, 
there was decreased transactivation of the hormone 
response element. Deletion of the region with the 
polyglutamine tract does not affect receptor func- 
tion as much as the expansion. Mutations in other 
parts of the molecule, especially the ligand-binding 
region, by contrast, cause a different disorder - the 
androgen insensitivity syndrome^^'^"^. The N-ter- 
minal region of the androgen receptor has recently 
been implicated in binding to other cell-specific 
enhancer factors^^. Expansion of the triplet repeat 
could prevent the androgen receptor from interacting 
with some other enhancer factor that has a restricted 
cellular expression pattern, which might, in part, 
explain the selective neuronal vulnerability observed 
in the disorder. 

As yet, little is known about the mechanism by 
which the expanded triplet repeat in IT15 causes 
Huntington's disease, a fully penetrant dominant dis- 
order. It may be that the greater prevalence of antici- 
pation in male transmission relates to some form of 
imprinting. It is uncertain how a mutation in a gene 
coding for a protein expressed widely in the brain 
and body causes neuronal degeneration preferen- 
tially in the basal ganglia and cerebral cortex. 
Preliminary data®^ suggest that the IT15 message is 
expressed in neurons and to a lesser extent in glia 
throughout the brain. Expression in the brain is 
higher than that in the periphery, but does occur 
widely in many tissues in the body. Furthermore, 
expression does not appear to be altered dramati- 
cally in either the caudate nucleus or cortex of 
Huntington's disease patients compared with con- 
trols. Thus the pathophysiology of the disorder 
might involve an action of an abnormal IT15 protein 
product. The protein encoded by IT15 has no 
prominent hydrophobic domains, and thus may be 
soluble. It might be a structural protein, or a protein 
that interacts in a non-covalent manner with a 
structural protein. It has one leucine zipper (of four 
leucines) and so could be a transcription factor of 
the Fos/Jun family, though it would be much larger 
than any other members of the family^^. If it were a 
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^Genbank (release 71.0) was searched with a 30-base string of ten CTG repeats (CAG 
equivalent to AGC, GCA, CTG, TGC or GCT). All genes shown have no more than two 
mismatches and a perfect match of at least six repeats. (The number of matches is far 
greater than would be expected by chance: a search with an arbitrary 30-base-long 
string yields no better than 1 3 mismatches.) 

^'The sequence in the database at the time the search was done had five repeats, but the 
most common allele has 13 repeats. 

* Genes which are known to undergo triplet repeat expansion, causing disease. 

transcription factor, it is conceivable that, in a 
manner similar to the androgen receptor, expansion 
of the polyglutamine repeat could alter its inter- 
action with a second, cell-type-specific factor, 
causing a gain-of-function phenotype with a 
restricted anatomical distribution corresponding to 
that of the second factor. Similarly, if it were a 
structural protein, an interaction with other struc- 
tural proteins might be disrupted. An alternative 
hypothesis suggests that Huntington's disease 
might be related to a mitochondrial dysfunction, 
perhaps inducing excitotoxicity^''. 

Other disorders with anticipation 

The discovery in rapid succession of four diseases 
caused by expansion of triplet repeats raises the 
possibility that more remain to be found. Several 
additional neuropsychiatric disorders with clinical 
features of anticipation could be candidates. 

Perhaps the most striking possibility is bipolar 
affective illness, which is a much more common 
disease than any of the previously discussed 
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TABLE II. Genes containing GAG repeats^ 
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disorders, with a lifetime prevalence of about 
0.5-1% of the population. In bipolar affective 
illness, there appears to be genetic transmission with 
an as yet undetermined pattern of inheritance in 
many families^^. We have studied pedigrees with 
unilineal transmission of bipolar affective illness^^ 
finding marked anticipation, with age of onset 
about 11 years earlier and episode frequency about 
twice as great in the second generation compared 
with the first^°. A Cox proportionate hazard analysis 
showed that the anticipation was separable from the 
previously known changing rate of major depression 
during this century or 'cohort effect'^^ In addition, 
changing patterns of drug abuse, deaths of affected 
persons prior to interview, decreased fertility and 
censoring of age of onset did not account for the 
results. Anticipation appears to be present in about 
40% of the pedigrees. 

Schizophrenia also runs in families, with an 
uncertain pattern of inheritance. The presence of 
individuals with 'schizophrenia spectrum' disorders 
in families of probands with schizophrenia^^ 
suggests variable penetrance. The risk to parents of 
probands is 6%, to siblings 9% and to children of 
probands 12%, combining data taken from a 
number of European studies of the illness using 
consistent diagnostic criteria^^. This pattern could 
have other interpretations, but is consistent with 
anticipation. Review of age of onset data from a 
study of pedigrees*^ with more than one affected 
case is also suggestive of anticipation. In about 15 
families, there was anticipation of 10 years or 
greater in transmitting pairs (Pulver, A., pers. 
commun.). 



Autosomal dominant hereditary ataxia also dis- 
plays variable age of onset, with some evidence for 
anticipation. This heterogeneous group of disorders, 
also referred to as spinocerebellar ataxia or her- 
editary olivo-ponto-cerebellar atrophy, is charac- 
terized by cerebellar ataxia, variably accompanied 
by dementia and extrapyramidal signs. In 1950, 
Schut^^ explicitly pointed out the presence of 
anticipation in a large pedigree, noting both a 
younger age of onset and a more severe course 
in later generations. Although no study specially 
designed to exclude ascertainment bias has been 
performed, data from a number of other pedigrees 
also suggest the presence of anticipation in at least 
some forms of hereditary ataxia^^^^. 

Finally, the distribution of illness in families of 
patients with autism not associated with fragile X 
syndrome could also be consistent with anticipation 
with a dominant pattern of inheritance. About half 
of parents of autistic probands have mild neuro- 
psychiatric disabilities that are characterized by 
social language disorders and aloofness^^. 

Functions of genes with triplet repeats 

In addition to the four genes causing diseases 
described above, other genes with triplet repeats are 
already known. There are about a dozen human 
genes with CGG repeats already entered into 
Cenbank^'^° In several cases the length of the 
repeat is polymorphic in the normal population. In 
most cases, CGG repeats appear to be in the 5' 
untranslated regions of the transcripts^. Unlike the 
CGG repeats, CAG repeats are predominantly 
within coding regions, and glutamine is the most 
common amino acid encoded (Table I). For GAG 
repeats in genes, eight out of ten are in coding 
regions, with glutamate being the most common 
amino acid (Table II). Another common triplet 
repeat, TAA, has been reported in only two genes, 
both in untranslated regions (not shown). 

The functions, if any, of repeated single amino 
acids in a protein are uncertain. Different amino acid 
repeats may have different functions. Repeated 
amino acids, especially glutamine, are present in a 
number of transcription factors, most likely in non- 
DNA-binding regions. Proteins containing so-called 
Opa repeats, which are polyglutamine tracts, in 
Drosophila and mouse^^'^^ often code for develop- 
mental-stage-specific transcription factors including 
homeodomain proteins. Human TATA-binding 
protein has 38 consecutive glutamine residues in its 
N-terminal domain^^ Transcription factor Spl has a 
glutamine-rich region with shorter glutamine re- 
peats, which are important for transcription^"^'^^. 
Repeat sequences of GGN, known as Pen, isolated 
in Drosophila, encode polyglycine and are present in 
a homologue of the rat helix-destabilizing protein^^. 
A proline-rich transcriptional activation zone has 
also been described^^. Regions rich in acidic amino 
acids are activational domains in a number of 
transcription factors^^. Inducible transcription fac- 
tors would be interesting candidate genes for brain 
disorders, because of their involvement in develop- 
ment and plasticity^^^\ 
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other repeats may be involved in different 
protein-protein interactions. For instance, a repeat 
of 11 proline residues in human calcineurin A has 
been proposed to mediate binding to calmodulin®^. 
In some cases, repeats of the hydrophobic amino 
acid leucine are present in signal sequences, where 
the exact amino acid sequence is believed to be less 
important than the overall hydrophobicity. 
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Additional novel genes with triplet repeats 

Knowing that triplet-containing genes are present 
in the human genome®^, and positing the possibility 
that other diseases could be caused by expansion of 
triplet repeats, the question arises: are there addi- 
tional genes with triplet repeats yet to be discovered? 
A number of groups have begun the search for such 
genes^^'^O'^"*-^^. One approach involves direct 
screening of patients' genomic DNA for expanded 
repeats using oligonucleotides and a ligase cycle 
reaction, termed 'repeat expansion detection'®''. 
This method can detect the expansions in myotonic 
dystrophy and fragile X DNA, and has suggested the 
presence of an expanded CTG repeat on chromo- 
some 18 in three families of the Centre d'Etude du 
Polymorphisme Humaine (CEPH). 

Another approach is to screen cDNA libraries to 
identify individual candidate clones. Riggins et al7^ 
recently identified a number of genes with triplet 
repeats by this method. Because they screened at 
relatively low stringency, the repeats identified were 
short (all but one had five or fewer repeats) and 
therefore were only slightly polymorphic in the 
normal human population. Since the four known 
diseases caused by expansion of triplet repeats all 
have repeat lengths that are highly polymorphic in 
the normal population, one would like a screening 
process that can identify repeats long enough to 
have such polymorphisms. 

By screening human brain cDNA libraries at high 
stringency we have been able to identify a series of 
novel genes with triplet repeats^°. Fourteen out of 
an initial group of forty clones encoded novel 
human genes, and eight of the inserts have been 
fully sequenced. All contain repeats and five of the 
eight have nine or more perfect repeats (see Fig. 2). 
The first clone sequenced was the myotonic dys- 
trophy gene with thirteen CTG repeats. In five 
clones with CAG repeats, the repeats were within 
long open reading frames, suggesting that they may 
be translated into amino acids. In four of these cases 
the amino acids would be glutamine. 

Northern blot analyses show that all of the genes 
are expressed in brain, with the suggestion of 
differential distribution (some were expressed more 
highly in cerebral cortex than cerebellar cortex, 
while others showed the reverse). Chromosomal 
assignments reveal a distribution on several auto- 
somes and one on the X chromosome. Furthermore, 
the repeat lengths of four out of the six CAG triplets 
were polymorphic, with two very highly poly- 
morphic, one with 85% heterozygosity and thirteen 
different alleles. 

In conclusion, expansion of triplet repeats consti- 
tutes a new form of dynamic mutation capable of 
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Fig. 2. Maps of the inserts of nine novel clones containing triplet repeats, 
obtained by screening a human brain cDNA library with CTG or CCG repeat 
oligonucleotides^^. Open reading frames are indicated by thicker lines. 
Nucleotide repeats are shown below, and predicted amino acids above the 
lines. Note that most of the repeats appear to be in open reading frames. The 
first clone sequenced (A2) was DM-1 of myotonic dystrophy, which was a 
novel sequence at the time. 



causing diseases with unusual forms of inheritance, 
particularly anticipation. A number of neuro- 
psychiatric disorders, especially bipolar affective 
disorder, show features suggestive of anticipation. 
Furthermore, there appear to be many genes with 
triplet repeats expressed in the human brain. Some 
of the repeats are relatively long and polymorphic in 
the normal population. We therefore suggest that 
novel human genes with triplet repeats may be 
candidates for these neuropsychiatric disorders. 

Note added in proof. A gene on chromosome 2 has 
recently been linked to familial colon cancer^^. 
These cancers contain multiple instabilities in simple 
sequence repeats, including triplet repeats [ident- 
ified using our probe CTG-B37 (Ref. 30 and Fig. 2)], 
which may contribute to the phenotype^°. A genetic 
approach may uncover mechanisms underlying in- 
stability of triplet repeats. 



Selected references 

1 Beckmann. J. S. and Weber, J. L (1992) Cerjomics 12. 
627-631 

2 Davies, K. E. (1992) Nature 356, 15 

3 Richards, R. I. and Sutheriand, G. R. (1992) Cell 70. 
709-712 

■4 Caskey, C. T., Pizzuti, A., Fu, Y-H., Fenwick, R. G. and 

Nelson, D. L. (1992) Science 256, 784-788 
5 Harper, P. S. (1989) Myotonic Dystrophy 2nd edn, W. B. 

Saunders 



TINS, Vol. 16, No. 7, 1993 



259 



Acknowledgments 

We thank Stylianos 
Antonarakis, 
Martin Schalling. 
Colin Stine, 
Be Wiehnga, 
Ben Oostra, 
Leonard Pinsky, 
Colin Fletcher, 
Susan Chamberlain, 
Kenneth Fishbeck, 
James Cusella and 
MarcyMacDonaldfor 
helpful discussions or 
sharing data prior to 
publication, or both. 
We thank Solomon 
Snyder and Susan 
Fo/stem for reading a 
draft of tne 
manuscnpt 
Figure 1 
IS adapted with 
permission from 
figures by 
G. Suthedand, 
Thomas Caskey and 
Kenneth Fischbeck 
and from 
data from the HD 
Collaborative Group. 
We thank Deborah 
Holifield for expert 
assistance with word 
processing. 



R. and Harper, P. S. (1973) Neurology 23, 



6 Dyken. P 
465-^73 

7 Penrose. L S. Ann. Bugenet. 14, 125-132 

8 Howeler. C. J., Busch. H. F. M., Ceraedts, J. P. M., Niermeiier 
M. F. and Staal, A. (1989) Brain 112, 779-797 

9 Mahadevan, M. et al. (1992) Science 255, 1253-1255 

10 Brook. J. D. et al. (1992) Ce//68, 79^08 

11 Fu, Y-H. etai (1992) Science 255, 1256-1258 

12 Harley, H. G. etai (1992) Nature 355, 545-548 

13 Tsilfidfs, C. MacKenzie, A. E., Mettler, G., Barcelo' J and 
Korneluk, R. G. (1992) Nature Cenet. 1, 192-195 

14 Hagerman, R. J. and Silverman, A: C. (1991) Fragile X 
Syndrome, Johns Hopkins University Press 

15 Sherman. S. L. etai. (1985) Hum. Genet. 69, 3289-3299 

16 Freund, L. S.. Retss. A. L., Hagerman, R. and Vinogradov, S 
(1992) Arch. Cen. Psychiat. 49, 54-60 

17 Fu, Y-H. etai. (1991) Cells?, 1047-1058 

18 Kremer, E.J. etai. i^99^) Science 252, 1711-1714 

19 Verkerk. A. J. M. H. etai. (1991) Cell 65, 905-914 

20 Oberle. I. etai. (1991) Science 252, 1097-1102 

21 Yu, S. et al. (1992) Am. J. Hum. Cenet. 50, 968-980 

22 Hinds, H. L. etai. (1993) Nature Cenet 3, 36-43 

23 Kennedy, W. R., Alter. M. and Sung, J. H. (1968) Neurology 
18, 671—680 

24 La Spada, A. R.. Wilson, E. M., Lubahn. D. B.. Harding A E 
and Fischbeck, K. H. (1991) Nature 352. 77-79 

25 La Spada, A. R. etai. (1992) Nature Cenet. 2, 301-304 

26 Doyu. M. etai. (1992) Ann. Neurol. 32, 707-710 

27 Folstein. S. E. (1989) Huntington's Disease. A Disorder of 
Families, Johns Hopkins University Press 

28 Ridley, R. M.. Frith. C. D., Crow, T. J. and Conneally P M 
(1988) / Med. Genet. 25, 589-595 

29 Ridley. R. M., Frith. C. D.. Farrer. L. A. and Conneally, P M 
(1991) Med. Genet. 28, 224-231 

30 Li, S-H., Mclnnis, M. G., Margolis, R. L, Antonarakis S E 
and Ross, C. A. (1993) Genomics 16, 572-579 

31 Huntington's Disease Collaborative Research Croup (1993) 
Cell 72, 971-983 

32 Grieder, C. and Blackburn, R. (1987) Ce//51. 887-898 

33 Schlotterer. C. and Tautz, D. (1992) Nucl. Acids Res 20 
211-215 

34 Richards, R. I. etai. (1992) Nature Genet 1, 257-260 

35 Harley, H. G. etai. (1991) Am. J. Hum. Cenet 49, 68-75 

36 Morton, N. E. and Macpherson, J. N. (1992) Proc. Natl Acad 
Sci. USA 89. 4215-^1217 

37 Chakravarti, A. (1992) Nature Genet 1, 237-238 

38 Shelbourne. P. etai. (1992) Hum. MoL Genet 1, 467-473 

39 Brunner, H. G. etai. (1993) New Engl. J. Med 328, 476-480 

40 O'Hoy, K. L. et at. (1993) Science 259, 809-812 

41 Hansen, R. S., Gartler. S. M.. Scott. C. R.. Chen. S. H and 
Laird, C D. (1992) Hum. Mot. Cenet 1. 571-578 

42 Bell. M. V. etai. (1991) Celt 64, 861-866 

43 Biancalana. V. etai. (1992) Hum. Mot. Cenet 1 255-258 

44 Pieretti. M. etat. (1991) Ce//66, 817-822 

45 Gedeon. A. K. etai. (1992) Nature Cenet 1, 341-344 

46 DeBoulle. K. etat. (1993) Nature Cenet 3, 31-35 

47 Reiss. A. L., Aylward, E., Freund, S. F., Joshi, K. and Byran 
R. N. (1991) Ann. Neurol. 29, 26-32 

48 Novelli, G. etat. (1992) Am. J. Hum, Cenet 51, A40 

49 Fu, Y-H. etat. (1993) Science 260, 235-238 

50 Jansen, G. etat. (1992) Nature Cenet 1, 261-266 

51 Ahringer, J. and Kimble. J. (1991) Nature 349, 346-348 

52 Pinsky. L. etat. (1992) Am. J. Hum. Cenet 51, Suppl. A41 

53 Pinsky, L. etat. (1992) Clin. Invest Med. 15, 456-472 

54 Batch, J. A. et at. (1992) Hum. Mot. Cenet 1, 497-503 

55 Adier, A. J.. Danrelsen, M. and Robins, D. M. (1992) Proc 
Natl Acad. Sci. USA 89, 11660-11663 

56 Landschuiz, W. H., Johnson, P. F. and McKnight, S. L (1988) 
Science 240, 175^1764 

57 Beal. M. F., Hyman. B. T. and Koroshetz, W. (1993) Trends 
Neurosci. 16, 125-131 

58 Mendlewicz, J. and Rainer, J. D. (1974) Am. J. Hum. Cenet 
26. 92-101 

59 Simpson, S. G., Folstein, S. E., Meyers. D. A. and DePaulo 
J. R. (1991) Am. J. Psychiat 149, 1660-1665 

60 Mclnnis. M. G. et al. Am. J. Hum. Cenet (in press) 

61 Cross-National Collaborative Group (1992) ; Am Med 
Assoc, 268. 3098-3105 

62 Kendler, K. S. and Gruenburg. A. M. (1984) Arch. Gen. 



Psychiat 41. 555-564 

63 Gottesman, I. I. (1991) in Schizophrenia Genesis p 96 
Freeman 

64 Pulver, A. E. etat. (1992) Br J. Psychiat 160 65-71 

65 Schut J. W. (1950) Arch. Neurol. Psychiat 63, 535-568 

66 Carpenter, S. and Schumacher, G. A. (1966) Arch. Neurol 
14, 82-94 

67 Zoghbi, H. Y. etai (1988) Ann. Neurol. 23, 580-584 

68 Diaz. G. O., Fleites. A. N., Sagaz. C. and Auburger. G. (1990) 
Neurology 40, 136^1375 

69 Landa, R. etat. (1992) Psychol. Med. 22, 245-254 

70 Riggins, G. J. etai. (1992) Nature Genet 2 186-191 

71 Wharton, K. A.. Yedvobnick, B.. Finnerty, V. G and 
Artavanis-Tsakonas. S. (1985) Ce//40. 55-62 

72 Duboule, D., Haenlin, M„ Galliot, B. and Mohier E (1987) 
MoL Cell. Biol. 7, 2003-2006 

73 Kao, C. C. al. (1990) Science 248, 1646-1650 

74 Kadonaga. J. T., Courey. A. J., Ladika. J. and Tjian, R. (1988) 
5c/ence 242. 1566-1570 

75 Courey. A. J. and Tjian. R. (1988) Cell 55. 887-898 
76,Haynes. S. R.. Rebbert. M. L. Mozer, B. A., Forquignon F 

and Dawid, I. B. (1987) Proc. Natl Acad. Sci. USA 84 
1819-1823 

77 Mermod. N.. O'Neill. E., Kelly, T. J. and Tjian, R. (1989) Celt 
58, 741-753 

78 Ma. J. and Ptashne. M. (1987) Ce//48. 847-853 

79 He, X. and Rosenfeld, M. G. (1991) Neuron 7 183-196 

80 Campeau, S. etat. (1991) Brain Res. 565, 349-352 

81 Cole, A. J., Saffen. D. W., Baraban, J. M. and Worley P F 
(1989) Wafiyre 340, 474-476 

82 Guerini, D. and Klee, C. B. (1989) Proc, Natl Acad. Sci USA 
86.9183-9187 

83 Edwards, A., Civitello, A., Hammond. H. A. and Caskey C T 
099^) Am. J. Hum. Cenet 49, 746-756 

84 Schalling. M., Hudson, T. J., Buetow. K. H. and Housman 
D. E. Nature Cenet (in press) 

85 Li. S. H., Mclnnis. M. G.. Margolis. R. L., Antonarakis, S E 
and Ross. C. A. (1992) Am. J. Hum. Cen. 51, A41 

86 Lee. C. C. Panzer, S. W. and Caskey, C. T. (1992) Am J 
Hum. Cenet 51. A 101 

87 Duyao, M. P. at. Nature Genet (in press) 

88 Ross. C. A. et at. Soc. Neurosci. Abstr (in press) 

89 Peltomaki. P. etat. (1993) Science 260, 810-812 

90 Aaltonaen. L. E. etat. (1993) Science 260, 812-816 



Erratum 

In the article The role of positron 
emission tomography in the assess- 
ment of human neurotransplantation' 
by Guy V. Sawie and Ralph Myers 
(May 1993, Vol. 16, pp. 172-176), 
Figures 2 and 3 were transposed rela- 
tive to their legends. Thus, the coronal 
section that appears as Figure 3 should 
appear above the legend for Figure 2 
and the parasaggital section that 
appears as Figure 2 should appear 
above the legend for Figure 3. We 
apologise to Drs SawIe and Myers, and 
to the readers for this error. 
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Summary 

Anticipation, or increasing severity of a disorder across successive generations, is a genetic phenomenon with an 
identified molecular mechanism: expansion of unstable trinucleotide repeat sequences. This study examined 
anticipation in familial schizophrenia. Three generations of siblines from the affected side of families selected 
for unilineal, autosomal dominant-like inheritance of schizophrenia were studied (n = 186). Across generations 
more subjects were hospitalized with psychotic illness (P<.0001), at progressively earlier ages (P<.0001), and 
with increasing severity of illness (P<,0003). The results indicate that anticipation is present in familial 
schizophrenia. These findings support both an active search for unstable trinucleotide repeat sequences in 
schizophrenia and reconsideration of the genetic model used for linkage studies in this disorder. 



Introduction 

The clinical observation of anticipation — i.e., inherited 
illness that becomes more severe across successive gen- 
erations — has recently been found to have a molecular 
basis: expanding GC-rich trinucleotide repeat sequence 
mutations (Harper et al. 1992; Sutherland and Richards 

1992) . In fragile X syndrome (Verkerk et al. 1991), 
myotonic dystrophy (Fu et al. 1992), spinobulbar mus- 
cular atrophy (Brook et al. 1992), spinocerebellar atro- 
phy type 1 (Orr et al. 1993), and Huntington disease 
(Huntington's Disease Collaborative Research Group 

1993) , increasing severity of illness, earlier age at onset, 
and/or increasing proportion of ill individuals in suc- 
cessive generations are associated with longer trinucleo- 
tide expansions. Schizophrenia is another neuropsychi- 
atric disorder that may display this anticipation 
phenomenon and that therefore may have familial 
forms caused by an unstable trinucleotide repeat. 

Schizophrenia is a severe disorder characterized by 
social withdrawal and psychotic symptoms, such as de- 
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lusions and hallucinations. The illness has a variable age 
at onset, often beginning in early adulthood and result- 
ing in lifelong disabilities in social and occupational 
functioning. Evidence from family, twin, and adoption 
studies, including those using reliable diagnostic crite- 
ria (Lowing et al. 1983; Kendler et al. 1985), strongly 
supports a genetic etiology for schizophrenia (Gottes- 
man and Shields 1982). However, the mode of inheri- 
tance for schizophrenia is not readily identifiable and is 
proposed to involve interacting genes (Risch 1990). In 
families with the illness, reduced penetrance and vari- 
able expression are commonly found. Other psychotic 
disorders, of lesser severity, and schizotypal personality 
traits such as social isolation, odd communication, and 
extreme suspiciousness are conditions likely reflecting 
variable expression of genetic susceptibility to schizo- 
phrenia (Gottesman and Shields 1982; Lowing et al. 
1983; Kendler et al. 1 985). These factors, along with the 
possibility of genetic heterogeneity and the practical 
difficulties of studying an illness with a significant sui- 
cide rate and suspicious, socially isolated individuals, 
combine to make schizophrenia a challenging disorder 
for linkage studies (Bassett 1991). Strategies to over- 
come these difficulties include focusing on familial 
schizophrenia where inheritance is consistent with 
Mendelian patterns, using reliable diagnostic methods, 
highly polymorphic DNA markers, and lod-score 
methods that model the complexities of the inheri- 
tance. Linkage studies to date, using informative fami- 
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lies and models based on Mendelian inheritance, have 
yielded significant lod scores but no replicated positive 
results (Bassett 1991). 

Dynamic modifications of classical patterns of ge- 
netic transmission, such as anticipation (Mott 1911) 
and genomic imprinting (differential expression of ge- 
netic material depending on parental origin of the gene 
and the underlying molecular mechanisms), may ex- 
plain the complex genetics of schizophrenia and other 
major mental illnesses (Mclnnis et al. 1991; Flint 1992). 
For example, trinucleotide repeats can cause reduced 
penetrance and variable expressiony^askey et al. 1992; 
Sutherland and Richards 1992), by existing in a premu- 
tation form, by reductions in repeat size, or by somatic 
mutation in early embryogenesis (Lavedan et al. 1993). 
The unstable nature of trinucleotide repeats also pro- 
vides a possible mechanism for the high mutation rates 
proposed for schizophrenia^later and Cowie 1971). If 
there were evidence of anticipation in schizophrenia, 
screening for triplet repeat mutations would become a 
rational option for gene localization studies, and modi- 
fication of the genetic model used in linkage studies 
would need to be considered. The current study inves- 
tigated whether anticipation was present in a familial 
schizophrenia sample participating in a linkage study. 

Subjects and Methods 

Subjects were members of eight extended noncon- 
sanguineous families participating in a genetic linkage 
study of familial schizophrenia. Local psychiatrists 
identified prospective pedigrees segregating schizophre- 
nia. Families were selected for large size, availability of 
two or more generations of adults, and apparent unilin- 
eal, autosomal dominant-like inheritance of schizo- 
phrenia and genetically related disorders. Bilineal fami- 
lies with evidence, from family or collateral history, of 
schizophrenia or other nonaffective psychotic dis- 
orders on both sides were excluded. Further details of 
the original ascertainment and assessment for the link- 
age study are described elsewhere (Bassett et al. 1993). 
Since families were ascertained in their entirety, pro- 
band status could be assigned to all affected subjects. 
Therefore no subjects were excluded from analyses. 

Subjects only from the affected side of each family 
were taken into account, to determine sibling sets (sib- 
lines) in the index generation (IG) (n = 13 siblines), 
parental generation (PG) (n = 10 siblines), and grand- 
parental generation (GG) (n = 8 siblines). The affected 
side was defined by (1) a parent hospitalized with psy- 
chosis (four cases), (2) an aunt/uncle hospitalized with 



psychosis (four cases), (3) a parent/aunt/uncle with 
schizotypal traits (seven cases), or (4) a sibship linking 
two affected nuclear families (six cases in two extended 
families) (see fig. 1). In two cases in the GG, the affected 
side could not be determined, and the smaller of the 
unaffected maternal or paternal siblines was arbitrarily 
selected. There were three instances of unknown pater- 
nity. In two of these cases, the maternal line was af- 
fected. In the third case, neither the mother nor her five 
siblings were affected, and this GG sibline was not in- 
cluded in the analysis. 

Family-history information was obtained for each 
subject from three or more family members by using 
the Family History-Research Diagnostic Criteria (FH- 
RDC) method (Andreasen et al. 1977). Genealogical re- 
cords were used to confirm dates of birth and death. 
Medical records were searched back to 1866 for evi- 
dence of psychiatric hospitalization. Because the sub- 
ject families originated and seldom moved from a cir- 
cumscribed region of Canada, and because the one 
psychiatric hospital available until the 1980s consis- 
tently maintained a comprehensive file-card system of 
recording admissions, virtually complete ascertainment 
of psychiatric hospitalization was assured. Records 
were collected for all subjects with a history of psychiat- 
ric hospitalization. Living subjects participating in the 
linkage study were directly interviewed by a psychiatrist 
(Bassett et al. 1993). Diagnostic folders containing the 
family history and, if present, medical records and in- 
terview data were reviewed independently by two psy- 
chiatrists (A.S.B. and W.G.H.), one of whom (W.G.H.) 
was blind to the pedigree structures. A consensus life- 
time Research Diagnostic Criteria (RDC) diagnosis for 
psychotic disorders, age at first hospitalization for a 
psychotic illness, and presence of two or more RDC 
schizotypal traits were recorded. Psychotic disorders 
included schizophrenia (n = 25), schizoaffective dis- 
order (« = 13; 12 mainly schizophrenic type and 1 
other), unspecified functional psychosis (« = 4), mania 
with psychosis (« = 1 ), and depression with psychosis (« 
= 1). Schizophrenia and schizoaffective disorder were 
approximately equivalent in severity in the current sam- 
ple (Bassett et al. 1993) and were considered together in 
the current study. Individuals with psychotic disorders 
not severe enough to require hospitalization, as well as 
subjects with schizotypal traits, were combined in a 
single schizotypal group, because of small numbers in 
each category. 

Of the 209 subjects in the affected siblines, 23 were 
excluded from the analyses. One IG subject had not 
attained the age of 15 years, considered a minimum age 
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Figure I Threepedigrees of the eight fam- 
ilies studied, illustrating anticipaiion in familial 
schizophrenia. IG, PG, and GG siblines on the 
affected side are shown. The numbers below indi- 
viduals indicate their age at first hospitalization GG 
for psychotic illness. An unblackened square (□) 
denotes an unaffected male; an unblackened cir- 
cle (O) denotes an unaffected female; a black- 
ened square (■) or circle (•) denotes hospitaliza- 
tion for a psychotic disorder; and a 
half-blackened square (El) or circle O) denotes 
schizotypal conditions. Sex and birth order of 
some individuals have been changed to protect 
confidentiality. A slash (/) through the symbol |q 
denotes that the individual is deceased. The box 
outlines a single affected lineage from a family 
connected at the grandparental level. 



of risk for psychotic illness (Gottesman and Shields 
1982). Five subjects (2 PG and 3 GG) had moved, and 
collateral information was insufficient to determine 
hospitalization status; and 17 subjects (3 IG, 5 PG, and 
9 GG) died before the age of 40 years. Thirteen died in 
infancy or childhood, two in war, one in an accident at 
work, and one of unknown cause; none were suicides. 
Most new cases of schizophrenia may be expected be- 
fore age 40 years (Gottesman and Shields 1982). Data 
on the remaining 186 subjects were examined for antici- 
pation, in three ways. First, the rates of hospitalization 
for psychotic disorders and the rates of schizotypal 
conditions were compared across generations by using 
analyses. Second, to assess severity of illness, subjects 
were assigned the following ratings: hospitalized with 
schizophrenia or schizoaffective disorders— 3; hospital- 
ized with other psychotic disorders— 2; schizotypal— 
1; and unaffected- 0. Means for each generation were 
compared using the one-way analysis of variance (AN- 
OVA), including correction for multiple tests of signifi- 
cance with the Student-Newman-Keuls procedure. 
Third, age at first hospitalization for psychosis was as- 
sessed using the life-table method of survival analysis 
for 1-year intervals. Homogeneity of survival curves 
over the generations was examined using the Wilcoxon 
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test. The analysis was performed assuming (1) no differ- 
ential mortality between affected and unaffected and 
(2) hospitalization rates independent of chronological 
time (e.g., 1920 vs. 1970). Observations ended at the 
subject's current age or age at death. Covariates tested 
were sex and transmission patterns (maternal/paternal). 

Results 

Demographic characteristics of the sample and re- 
sults indicating anticipation are presented in table 1. As 
for other illnesses demonstrating anticipation, expres- 
sion of illness varied between members of a sibship 
(fig. 1). 

Rotes and Distribution of Illness 

There were significantly more subjects hospitalized 
for psychosis across successive generations (x^ ^ 16.84, 
P<.0001, 2 df). Most had schizoph renia or schizoaffec- 
tive disorders (IG, n = 30; PG, « = 8; and GG, « = 0). Of 
the six subjects with less severe disorders— unspecified 
functional psychosis (« = 4), psychotic mania (« = 1), or 
psychotic depression (w = l)--four were in PG or GG. 
Subjects with the least severe illnesses (schizotypal con- 
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Table I 



Characteristics of the Sample, by Generation (n = 186) 




IG 


PG 


GG 




86 (38) 


62 (24) 


38 (21) 


Mean size of sibline (SD) 


6,61 (4.27) 


6.20 (3.61) 


5.43 (4.20) 


Mean age of living* (SD) 


40.61 (8,74) 


65.55 (12.96) 


71.80 (8.26) 


Mean age at death** (SD) 


44.25 (3.20) 


57.15 (14.58) 


77.35 (9.34) 


Mean age at first hospitalization (SD) 


26.16 (8.28) 


34.00(17.28) 


41,33 (15,89) 


Subjects hospitalized for psychotic illness .... 


32 (37.21%) 


9(14.52%) 


3 (7.89%) 


Subjects with schizotypal conditions 


8 (9.30%) 


8 (12.90%) 


8(21.05%) 



• No. of living subjects: IG, 82; PG, 42; and GG, 10. 
No. of dead subjects with known age at death: IG, 4; PG, 20; and GG, 20. Eight other GG subjects had 
ages at death that were less precisely known, e.g., "in their 70s." The mean shown did not change signifi- 
cantly when these subjects were included using estimated ages at death. 



ditions) were twice as common in the GG as in the IG 
(21.05% vs. 9.30%); however, across the three genera- 
tions, the result was not significant (x^ = 3.24, P = .20, 
2 df). When these subjects with schizotypal conditions 
were included with hospitalized subjects, there were 
still significantly more affected subjects in the youngest 
generation (x^ = 6.86, P - .032, 2 df). 

Because of the possibility of compounded error with 
families being connected at the parental or grandparen- 
tal level, analyses were rerun with a single affected lin- 
eage from each of the seven kindreds that had data 
from all three generations, with the largest siblines be- 
ing selected (see fig. 1). The same results were found 
using this subsample of 151 subjects, for both hospital- 
ization rates (x^ = 12.30, P<.002, 2 df) and hospitaliza- 
tion plus schizotypal rates (x^ = 9A6, P<.009, 2 df). 
Excluding subjects who died before age 40 or moved 
away (n = 22) could have influenced the study's find- 
ings, since most were in the two senior generations. 
When results with these subjects and with penetrance 
estimated maximally at 100% were considered, so that 
one-half {n - W) would have been hospitalized with 
psychotic disorders, the results for hospitalization rates 
would still have remained significant {y} = 10.02, 
P<.01, 2 df). Data were also reanalyzed examining the 
effect that increasing hospitalization rates over time 
would have on the results. Secular trends of hospitaliza- 
tion rates for specific illnesses were not available. There- 
fore, arbitrary increases were tested, by adding 100% 
more subjects (n = 3) to the hospitalized GG group and 
50% more (n = 4,5) to the PG group. Under these con- 
ditions, a significant increase in the rate of psychosis 
requiring hospitalization would continue to be present 
over the generations (x^ = 7.75, P<.05, 2 df). 



Severity of Illness 

Means for four-point severity-of-illness ratings for 
the three generations were as follows: IG (« = 86), 1,19; 
PG (n = 62), 0.58; and GG (« = 38), 0.37. Severity of 
illness significantly increased over the generations (F 
= 8.39, 2 df, P = .0003). Pairwise comparisons using 
the Student-Newman-Keuls test revealed that the signif- 
icant difference was between the IG and the PG and 
GG. PG and GG severity means were not significantly 
different from each other. 

Age at First Hospitalization 

The survival curve (fig. 2) shows that subjects were 
first hospitalized for psychosis at progressively younger 
ages, across generations (x^ = 26.76, P = .0001, 2 dO- 
Sex of subject was not a significant covariate to genera- 
tion, for age at first hospitalization (increment x^ 
= 0.40, P = .53, 1 df). All of the GG and most (68%) of 
the PG subjects had achieved an age of 55 years or 
more, by which time they would have been virtually 
through the age at risk for schizophrenia (Gottesman 
and Shields 1982). 

Imprinting 

There were equal rates of maternal and paternal 
transmission (six and seven cases, respectively) from the 
PG to the IG. The mean age at first hospitalization for 
maternal transmission (PG to IG) was 25.00 years (SD 
5,94 years), and that for paternal transmission was 
27,64 years (SD 10.63 years), a nonsignificant difference 
(t = .83, P = .41). Grandparental to parental transmis- 
sion was predominantly maternal (seven of nine cases), 
with one unknown and only one example of paternal 
transmission. Sex of transmitting parent was not a sig- 
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Figure 2 Sur\'ival cur\'es for age at first hospitalization for 
psychotic illness, comparing IG (a), PG {•), and GG (■). Annual 
percent sun'iving without hospitalization is plotted for every 5th 
year. Observation of individuals ended at their age of death or, if they 
were living, at their current age. 

nificant covariate in the survival analysis examining age 
at first hospitalization (increment = 0-^^) ^ = --^^j 
2df). 

Discussion 

The results suggest that familial schizophrenia ex- 
hibits anticipation. All of the families studied showed 
this phenomenon, manifest as increasing rates of hospi- 
talized psychotic illness, w^orsening severity of illness, 
and/or earlier age at onset, across successive genera- 
tions (fig. 1). These findings are consistent with differ- 
ences in rates of hospitalization and age-at-onset data 
for parent-child pairs in studies of schizophrenia over 
the century (Mott 1910; Kay 1963; Penrose 1971; De- 
cina et al. 1991). In each of these studies, rates of hospi- 
talization for psychosis were lower for antecedent gen- 
erations, and age at onset for parents was significantly 
later than that for offspring. Investigations of ancestors 
and extended families also support these findings 
(Karlsson 1966; Odegaard 1972; Wetterberg and 
Farmer 1991). As well, less severe psychotic illnesses 
(e.g., affective disorders) are consistently more com- 
mon in the generation antecedent to schizophrenic 
probands (Slater and Cowie 1971; Bleuler 1978). These 
results complement reported morbid risk of schizophre- 
nia for parents, which is almost half that for siblings 
(Gottesman and Shields 1982). While alternative rea- 
sons, including selection biases such as reduced fertility 
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in earlier-onset schizophrenia, have been proposed to 
explain these clinical observations, they are all consis- 
tent with the phenomenon of anticipation in schizo- 
phrenia. 

In contrast, findings from the current study are only 
suggestive of sex-specific differences in the transmis- 
sion of schizophrenia. However, the possibility of an 
excess maternal over paternal transmission in schizo- 
phrenia is consistent with trends found both recently 
by others (Sharma et al. 1993) and in studies of large 
data sets in the older literature (Penrose 1971; Slater 
and Cowie 1971). Fertility may be especially low in 
male patients with schizophrenia (Gottesman and 
Shields 1982), and in the current study this could be the 
reason for the high rate of maternal transmission from 
the GG lines to the PG lines. If this were the case, 
however, one would have expected predominantly ma- 
ternal inheritance from the PG to the IG lines; but ma- 
ternal and paternal rates were equal. Comparable trans- 
mission patterns have been found to be due to greater 
variation of trinucleotide repeat length after female 
meiosis in myotonic dystrophy (Lavedan et al, 1993). 
The effect may be subtle, requiring larger samples to 
demonstrate imprinting in schizophrenia. 

There are several possible biases that can explain re- 
sults that indicate anticipation (Penrose 1948). First, 
subjects could have died before expressing the muta- 
tion. However, all of the GG and most of the PG lived 
beyond age 55 years. Also, significantly diflFerent hospi- 
talization rates across generations remained in the 
current study, even when half of those who had moved 
or died before age 40 years were assigned aflFected sta- 
tus. Second, reduced fertility of individuals with earlier 
onset of schizophrenia could cause preferential ascer- 
tainment of parents with later onset. This bias should 
be minimized in the current study, because (1) few par- 
ents were affected with psychosis and (2) siblines were 
large in all three generations, providing multiple oppor- 
tunities for detection of affecteds, regardless of their 
fertility. Third, subjects in the IG could have been too 
young to yet express a late-onset form of psychosis, 
which would attenuate the age-at-onset findings. Since 
one-half of the subjects in the IG were over age 40 
years, most were beyond the period of highest risk. 
Even if more new cases of psychosis subsequently 
arose, this would most likely occur in the IG and would 
only serve to strengthen the present study's findings 
with respect to differential rates of illness. 

In contrast to major depression (Gershon et al. 1987), 
there is no evidence for a cohort effect in schizophre- 
nia. However, secular trends, such as improved detec- 



Anticipation in Schizophrenia 



869 



tion, that could, over time, lead to higher hospitaliza- 
tion rates and/or younger age at first hospitalization 
for psychosis are important to consider, since these 
could have influenced the principal findings of the 
current study. In the literature (Mott 1910; Kay 1963; 
Penrose 1971; Decina et al. 1991), examination of ages 
at first hospitalization did not reveal secular trends to 
younger age over the century. In the current study, ar- 
bitrary increases in hospitalization incidence assigned 
to the PG or GG did not change the observation of 
anticipation. Specific factors that may influence secular 
trends in hospitalization, including drug abuse and psy- 
chosocial stressors, do not appear to have played a role 
in the sample studied. Only three hospitalized subjects 
in the IG had a history of stimulant or hallucinogen use, 
drugs that may in some cases precipitate a psychotic 
illness. On the basis of direct interviews, it appeared 
that psychosocial stressors endured by PG and GG, 
such as the World Wars and Great Depression, were 
more severe than those faced by the IG. 

Another possibility is that the family-history method 
tends to underestimate rates of psychiatric disturbance, 
particularly in relatives who are dead or less known 
(Andreasen et al. 1977). The consequence could be that 
actual rates for schizotypal traits could be higher than 
those found, particularly for PG and GG. However, 
results for severe illness requiring hospitalization would 
remain unchanged. Another factor that could have 
compounded errors in the PG and GG was the use of IG 
siblines connected at the parental or grandparental 
level. However, both the fact that results remained the 
same when only one ascending line from each kindred 
was examined and the fact that at least 13 affected 
siblines had resulted from eight originating GG lines 
support the finding of anticipation. 

A limitation that is important to consider in studies 
of common illnesses is the potential for assortative 
mating to cause an apparently increased prevalence of 
illness in offspring. This possibility was minimized by 
selecting unilineal pedigrees with no evidence of schizo- 
phrenia or related disorders in the married-in person, 
their siblings, or parents. Although individuals marrying 
in could have been nonexpressing carriers of the dis- 
order with nonexpressing close relatives, the likelihood 
appears small that assortative mating could account for 
the results in the current study. Because the families 
studied were selected because of their autosomal domi- 
nant-like inheritance and large sibships, the results may 
not be generalizable to schizophrenia in the general pop- 
ulation, although they are consistent with observations 
from large population-based samples (Penrose 1971; 



Bleuler 1978). Also, the schizophrenia in the subject 
families may be a particularly severe form, which could 
exaggerate the findings. However, the mean age at on- 
set for the IG is similar to others' results (Mott 1911; 
Gottesman and Shields 1982; Decina et al. 1991; 
Sharma et al. 1993). Data on specific symptom patterns 
suggest that the familial schizophrenia in the present 
families is comparable in nature and in severity to sam- 
ples drawn from the general population (Bassett et al. 
1993). 

Despite the possible biases and limitations, the 
weight of evidence from both the current investigation 
and the literature is consistent with the finding of antici- 
pation in familial schizophrenia. Other families with 
schizophrenia should be examined for anticipation and 
possible accompanying maternal imprinting phenom- 
ena, to confirm the current study's findings. Although 
published pedigrees consistently show evidence of an- 
ticipation (Karlsson 1966; Wetterberg and Farmer 
1991), complete ascertainment of other large kindreds 
with contemporary reliable diagnostic assessments 
would be useful. However, the most exciting possibili- 
ties — and the confirmation of the clinical observations 
of the current study — lie in the search for expanding 
trinucleotide repeats and other DNA sequence muta- 
tions in schizophrenia. New methods becoming avail- 
able to detect these mutations (Orr et al. 1993; Schal- 
ling et al. 1993) will complement and may accelerate the 
search for pathological genes in linkage studies. In ad- 
dition, the current study has implications for the ge- 
netic model used in linkage studies. Parameters, particu- 
larly penetrance, which would vary according to 
generation, may need to be modified to reflect the ef- 
fects of anticipation and possibly imprinting. In the 
light of clinical evidence for anticipation, these strate- 
gies represent real promise for deciphering the genetics 
of schizophrenia. 
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Summary 

Anticipation refers to the increase in disease severity or decrease in age at onset in succeeding generations. This 
phenomenon, formerly ascribed to observation biases, correlates with the expansion of trinucleotide repeat 
sequences (TNRs) in some disorders. If present in bipolar affective disorder (BPAD), anticipation could provide 
chies to its genetic etiology. We compared age at onset and disease severity between two generations of 34 
liuitineal families ascertained for a genetic linkage study of BPAD. Life-table analyses showed a significant 
decrease in survival to first mania or depression from the first to the second generation {P < .001). 
Intergenerational pairwise comparisons showed both a significantly earlier age at onset {P < .001) and a 
significantly increased disease severity (P < .001) in the second generation. This difference was significant under 
each of four data-sampling schemes which excluded probands in the second generation. The second generation 
experienced onset 8.9-13.5 years earlier and illness 1.8-3.4 times more severe than did the first generation. In 
additional analyses, drug abuse, deaths of affected individuals prior to interview, decreased fertility, censoring 
of age at onset, and the cohort effect did not affect our results. We conclude that genetic anticipation occurs in 
this sample of unilineal BPAD families. These findings may implicate genes with expanding TNRs in the 
g..:t:tic etiology of BPAD. 



Introduction 

The term *'anticipation" describes an inheritance pat- 
tern within a pedigree wherein disease severity increases 
or age at onset decreases in successive generations (Ho- 
weler et al. 1989; Harper et al. 1992), Early in this cen- 
n anticipation was reported in myotonic dystrophy 
(DM) (Fleischer 1918) but was later discounted as a bias 
of ascertainment (Penrose 1948). Recent clinical inves- 
tigations have supported the presence of anticipation in 
DM (Howeler et al. 1 989), noting that it was likely to be 
inherent in the mode of transmission. Anticipation has 
also been shown in fragile X syndrome and helps ex- 
plain the apparent non-Mendelian features of its inheri- 
tance (the Sherman paradox; Sherman et al. [1985]). 
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Recently, anticipation in DM (Mahadcvan er al. 1992) 
and fragile X (Fu et al. 1991) was directly associated 
with expansion of trinucleotide repeats (TNRs) at the 
disease locus (Caskcy er al. 1992). 

Some early observers claimed that the clinical phe- 
nomenon of anticipation was present in the inheritance 
of psychiatric disorders (Morel 1857; Mott 1910, 
1911). However, no studies have addressed anticipa- 
tion in psychiatric disorders since Kraepelin (1921) sepa- 
rated schizophrenia from bipolar affective disorder 
(BPAD) early in this century. BPAD family (Winokur et 
al. 1982; Rice et al. 1987), twin (Bertelsen et al, 1977), 
and adoption studies (Mendlewicz and Rainer 1977) 
demonstrate genetic transmission, but the mode of in- 
heritance remains uncertain (Rice et al. 1987). Anticipa- 
tion could be one of the factors influencing the mode 
of transmission. 

In view of (1) the new data describing the molecular 
basis of anticipation and (2) the persistent uncertainty 
about the mode of inheritance of BPAD, we examined 
34 families ascertained for a genetic linkage study of 
BPAD for evidence of anticipation. Our findings sug- 
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gest that anticipation occurs in this sample of families, 
as demonstrated by an earlier age at onset and greater 
severity of illness in successive generations. 

Subjects and Methods 

Ascertainment 

Subject selection and evaluation for the genetic link- 
age study of BPAD are described in detail elsewhere 
(Simpson et al. 1992) and will be summarized here. 
Clinics and inpatient units in Baltimore and Iowa City 
have been screened for treated bipolar I (BPl) or bipolar 
II (BPII) probands with two or more sibs or with one sib 
and one parent with an affected phenotype by family 
history. BPl disorder is defined by the presence of 
"mania," a period of severely elevated mood and activ- 
ity, in BPII disorder the periods of elevated mood or 
activity arc less severe and, while noticeable to the pa- 
tient or family, are not incapacitating (Coryell ct al. 
1984). For the purposes of ascertainment, the affected 
phenotype included any bipolar, unipolar depressive, or 
schizoaffective disorder. Eighteen hundred probands 
have been screened. After a multi-informant family psy- 
chiatric history was obtained, probands with an af- 
fected phenotype in both parental lines and probands 
whose key relatives were unavailable or uncooperative 
were excluded, leaving 125 families meeting the ascer- 
tainment criteria. 

Evoluotion 

In these 125 families, probands and all available first- 
and second-degree relatives were interviewed by a psy- 
chiatrist using the Schedule for Affective Disorders and 
Schizophrenia — Lifetime version (SADS-L) (Endicott 
and Spitzer 1978). All diagnoses conformed to the Re- 
search Diagnostic Criteria (RDC) (Spitzer et al. 1975). 
Inten'iews have been completed on 599 subjects. For 
the anticipation analysis, the affected phenotype in- 
cluded (1) major depression, single episode (MDS), (2) 
major depression, recurrent (MDR), (3) BPil, with re- 
current major depression, (4) BPl, and (5) schizoaffec- 
tive disorder, manic type (SA). 

Study Sample 

For the present study we excluded from the 125 
ascertained families (1) families with clinical evidence of 
bilineality, on the basis of either the presence of an 
affected phenotype or even a single episode of major 
depression in both parental lines (the additive effect of 
disease genes from both parental lines could create the 
appearance of anticipation); and (2) families without at 
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least one inter\iewed, affected individual in ea^n of 
two successive generations (necessary for the compari- 
son of affected individuals in successive generations). 
This left 34 families in the present study. 

The older generation (Gl) — which usually consisted 
of parents, aunts, and uncles of the proband — included 
63 affected and 78 unaffected individuals. The younger 
generation (G2), which usually consisted of sibs and 
first cousins of the proband, included 107 affecrcJ and 
20 unaffected individuals. We did our analysis ...ider 
the assumption of autosomal dominant transmission of 
the affected phenotype. The ratio of affected to unaf- 
fected G2 individuals (107:20) represents only those 
family members directly interviewed by a psychiatrist 
and includes the 30 G2 probands, hi four families the 
proband and his or her sibs constituted Gl, while the 
proband's children, nieces, and nephews constituted 
G2. We obtained extensive family history information 
on all members of the G2 generation, regardless of 
whether they were interviewed, but we did not include 
the uninterviewed relatives in the anticipation analysis. 
When we use family history and direct interview data to 
diagnose the relatives in the G2 generation (excluding 
probands), the ratio of affected to unaffected individ- 
uals (74:57) docs not differ significantly from the ex- 
pected autosomal dominant ratio (x^ = 2.18, P > .10). 

Anticipation Measures 

The measures of anticipation were (1) age at onset 
and (2) episode frequency. The age at onset was defined 
as the first episode of RDC major depression or mania 
(whichever was earlier). These events are assessed, with 
high reliability, by the SADS-L (Egeland et al. 1987). 
Episode frequency, considered a measure of disease se- 
verity, was calculated on the basis of the total number 
of episodes of mania and depression divided by the 
total number of years from onset to interview (-Ve at 
interview minus age at first mania or depression). 

Sampling Schemes 

A difficult statistical problem in family studies is that 
relatives do not constitute strictly independent data 
points. For example, their onset ages for a given genetic 
disorder are often more closely correlated than onset 
ages of nonrelatives. Since there is no simple solution to 
this problem, we performed analyses on four di^^-^ent 
samples of pairs selected from our families. Each sam- 
pling scheme is subject to different criticisms, bur, if ^i" 
schemes lead to the same conclusions, the results are 
unlikely to be the consequence of weaknesses of any 
one sampling scheme. 
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In all sampling schemes, probands were excluded 
when part of G2. This avoids any bias arising from the 
ascertainment of probands when their illness became 
severe enough to require treatment. 

The four sampling schemes were as follows: 

/. Random pairs. — One randomly selected affected 
r.:::cive from Gl was paired with one randomly selected 
affected relative from G2 in each family, to form a ran- 
dom pair (// = 34 pairs). This is a statistically conser\a- 
tive scheme, but it fails to use all affected individuals. 
By including childless members of Gl, this scheme, as 
well as scheme 111 below, addresses the bias arising from 
the possibility that severely ill members of Gl will not 
produce children. 

//. Random transmitting pairs. — One randomly se- 
1- :ed affected parent was paired with one randomly 
selected affected child, in four families there was no 
affected parent. For these families only, the unaffected 
parent was arbitrarily assigned an onset age equal to the 
age at inter\iew and an episode frequency of zero. 
Thirty-four randomly selected transmitting pairs were 
generated. Although this statistically conser\'ativc 
scheme fails to include childless members of Gl, it is 
the most direct test of the hypothesis that the illness is 
!• -easing in severity when passed from parents to their 
cnildren. 

///. Ail possible pairs,— Each affected member of Gl 
was matched with each affected member of G2, to gen- 
erate several pairs per family, with a total of 196 possi- 
ble intergenerational pairs. Although this scheme uses 
some data points more than once, it tests the robust- 
ness of the findings based on the random pairs. It also 
includes all affected members of Gl, even if they pro- 
duced no children. 

All possible transmitting pairs.— Each affected 
parent in Gl was matched with each of his or her af- 
fected children, to generate 97 pairs. Similarly to 
scheme III, this scheme uses some data points more 
than once but tests the robustness of the findings based 
on the random transmitting pairs. 

Statistics 

Onset age was compared between all members of Gl 
a:'.u all members of G2 by using life-table analysis and 
the Gehan's generalized Wilcoxon test. Pairwise inter- 
generational differences were compared by using the 
nonparametric Wilcoxon matched-pairs statistic, since 
onset age and illness frequency all showed highly non- 
Gaussian distributions. The Cox proportional hazards 
model was used to analyze the relationship between 




Age Interval (yr) 

Figure I Cumulative survival to first mania or major depres- 
sion of affected individuals in the older generation (Gl) compared 
with the younger generation {Gl). 



generation and onset age while controlling for the ef- 
fect of birth cohort. 

All analyses were performed by using CSS-Statistica 
(StatSoft). The Cox model was analyzed using PECAN, 
version 2.4 (Wisconsin Clinical Cancer Center). 

Results 

Life-Table Analysis of Onset Age 

Figure 1 shows survival to first mania or major de- 
pression for members of Gl versus members of G2. As 
a group, G2 experienced a significantly earlier onset of 
illness compared with Gl (Gchan Wilcoxon test statis- 
tic = -5.75388, P < .001). Fifty percent of affected 
subjects in G2 became ill by age 23 years, as compared 
with age 37 years for members of G 1 , a difference of 1 4 
years. 

Pairwise Comparisons 

in pairwise comparisons, members of G2 experi- 
enced a significantly earlier illness onset than did 
members of Gl (tabic 1). This significantly earlier onset 
for G2 was obsen ed under all four sampling schemes. 
Members of G2 fell ill 8.9-13.5 years earlie^'r than did 
members of G 1 . Overall, 79% of random pairs, "9''^o of 
random transmitting pairs, 83% of all possible pairs, 
and 80% of all possible transmitting pairs showed antici- 
pation based on age at onset. 

Table 1 also shows that members of G2 had a signifi- 
cantly higher frequency of episodes than did members 
of Gl. Again, a significant difference was seen regard- 
less of sampling scheme. Members of G2 experienced 
an episode of illness 1.8-3.4 times as often as did 
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Table I 

Pairwise Comparisons 
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<.001 


<.001 
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.16 
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.17 


.32 
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2.84 


6.76 


5.88 


P 




<.00l 


<.001 


<.00l 


<.001 



■* W'ilcoxon ni.uclied-pairs (ei.t. 



members of Gl. Anticipation based on episode fre- 
quency was seen in 76% of random pairs, 76% of ran- 
dom transmitting pairs, 75% of all possible pairs, and 
79% of all possible transmitting pairs. 

Changes in their age at onset or episode frequency 
which were consistent with anticipation were seen in 
95.3% of all pairs. There was no significant effect of the 
sex of the affected parent on the intergenerational dif- 
ferences in either age at onset (P > .16) or episode 
frequency {P > .27). 

In the course of data analysis it was observed that, in 
26 of 34 random transmitting pairs, there was a change 
in the form of the affected phenotype (e.g., from an 
MDR parent to a BPI offspring). In 18 of these pairs 
either the diagnosis of unaffected, MDS, or MDR in 
the parent became a diagnosis of BPII or BPI in the 
offspring or there was a shift from BPII in the parent to 
BPI in the offspring. In the remaining eight pairs the 
reverse occurred. When subjected to the sign test, this 
trend toward "phenotypic progression" did not reach 
significance (P = .08). 

Correction for Possible Biases 

We attempted to correct for several possible biases in 
post hoc analyses of the random pairs. Since substance 
abusers may have an earlier onset and more severe af- 
fective illness, the analyses were repeated after the five 
pairs containing a substance abuser in G2 were elimi- 
nated. All differences remained significant (P < .01). 

Another potential bias arises from the possibility that 



severely affected individuals die young and thus are less 
likely to be available for evaluation as members of Gl. 
We determined that 5 of 34 deceased members of Gl 
were likely to have been affected, on the basis of family 
history from multiple informants. We assigned these 
individuals an onset age of 19 years and an episode 
frequency score of .315, the median values for members 
of G2, and repeated the analyses. All differences re- 
mained significant (P < .001). 

Since subjects cannot be obser\'ed to fall ill later rr..:n 
the age at which they w^re examined, G2 may have a 
paucity of late-onset cases simply because members of 
G2 are younger, thus creating a false impression of an- 
ticipation. In an attempt to address this bias, we as- 
signed the 20 unaffected members of G2 an onset age 
of 61.7 years, equal to the median age of contact for 
Gl. Despite this correction, onset age remained signifi- 
cantly lower in G2 versus Gl (P < .001). 

Finally, we addressed the cohort effect (Gersho; et 
al. 1987), which concerns the reported earlier onset of 
major affective disorder for members of cohorts born 
since 1945. This phenomenon could create the appear- 
ance of anticipation, since members of G2 tended to 
belong to later birth cohorts. To address this bias, we 
performed a bivariate analysis using the Cox propor- 
tional hazards model (Cox 1972), with birth cohort and 
generational membership as covariatcs and with age at 
first mania or major depression as the cndpoint. ' "> 
jects were divided into three birth cohorts: (1) those 
born before 1945 (58 in Gl, 1 1 in G2); (2) those born 
between 1945 and 1960 (5 in Gl, 72 in G2); and (3) 
those born after 1960 (0 in Gl, 24 in G2). The results 
indicated a significant correlation of generational mem- 
bership with age at onset (P = 0.7658, Z ^ 2.212, P 
= .027) even when a significant effect of birth cohort 
was controlled for (cohort 2 vs. cohort 1 : P = 0.8355, Z 
- 3.493, P = .013; cohort 3 vs. cohort 1: p = 2.0: Z 
= 4.522, P < .001). 

Discussion 

In this study, we report data supporting the occur- 
rence of anticipation in 34 unilineal bipolar families. 
Under each of four different data-sampling schemes, 
members of the younger generation manifested a signifi- 
cantly more severe form of illness, on the basis of nc:' 
onset and episode frequency. This phenomenon was 
obser\'ed in the majority of our families, with 95% of 
families showing anticipation based on either onset age 
or episode frequency. 

The appearance of anticipation can be created by 
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several different observation biases. We attempted to 
address these biases in our study design and in post hoc 
analyses. The study design was based on two measures 
of anticipation and on four schemes for sampling the 
data. Thus it is not likely that our findings are due to 
the particular weaknesses of any one measure or sam- 
pli : : scheme. Further, the study design sought to elimi- 
nate bilineal families, thus minimizing the possible ad- 
ditive effect of disease genes from both parental lines 
converging in the younger generation. Finally, the study 
design incorporated sampling schemes which included 
childless members of the older generation, thus mini- 
mizing the bias of severely ill individuals having fewer 
children. 

All results remained significant when the possible im- 
pn/r of substance abuse, differential mortality among 
members of the older generation, censoring of onset 
age, and the cohort effect were addressed in post hoc 
analyses. Thus we conclude that our findings are not 
artifacts arising from these observational biases. 

In an attempt to control for the effect of an individ- 
ual's being raised by an affected parent, we examined 
our data by using sampling scheme II (random transmit- 
ting pairs). Wc found that, in 14 of the 34 families, the 
G2 individual had onset of illness prior to that of the 
G: parent. In these families there was significant evi- 
dence of anticipation, on the basis of age at onset and 
episode frequency. This suggests that a genetic effect 
underlying our observation is likely, although a more 
definitive answer to this issue might be provided by 
adoption studies. 

Two further biases deserve mention. Older subjects 
may report a later than actual age at onset because they 
forget earlier episodes (Rice et al. 1987; Warshaw et al. 
1 '1). We addressed but did not wholly overcome this 
problem by collecting data in a scmistructured inter- 
view with a psychiatrist, wherein subjects' reports were 
exposed to medical cross-examination. If our findings 
were due primarily to forgetting, we would expect a 
smaller generational difference for events less likely to 
be forgotten. However, when we analyzed age at first 
mania, defined by RDC as an ''incapacitating event" 
(and thus less likely to be forgotten), there was an even 
1 ."ger generational difference in onset age (GI =40 
years of age, G2 = 21 years of age; P < .001). 

Our ascertainment scheme, which required three af- 
fected members (including proband) per family, may 
have introduced some bias into the analysis. Such fami- 
lies arc not likely to be representative of families af- 
fected by bipolar disorder in the community. They are 
likely to be enriched for genetic cases. There may also 



be an overrepresentation of severely ill individuals in 
G2, because of a tendency to preferentially recruit fami- 
lies that exceed the ascertainment criteria. Simply re- 
moving the proband from analyses would not over- 
come this bias. However, we did not require that 
affected sibs of the proband meet any severity criterion; 
indeed, 80% of the affected sibs were never hospital- 
ized for their affective disorder, and 46% never received 
treatment. 

Finally, there is the problem, first described by 
Penrose (1948), that pairs consisting of a parent with 
early-onset illness and a child with late-onset illness are 
unlikely to be ascertained by any study, since there is 
such a large span of time separating the two onset 
events. The effect of this bias is difficult to measure. 
However, we expect that the effect would be small in 
our sample. First, we have ascertained some early-onset 
parents; in 24% of our families, the affected parent had 
an onset of illness at ^24 years of age. Second, we have 
already observed most of the illness onsets that are 
likely to occur in the second generation. In this genera- 
tion 70% (excluding probands) were affected at inter- 
view, including 1 3 subjects with onset ages ^30 years of 
age. Thus it is unlikely that the Penrose bias is exerting a 
large effect on this sample. 

We conclude that anticipation appears to occur in 
our sample of families with BPAD. These preliminar>' 
findings arc of potential importance, since the presence 
of anticipation in even a subset of families with BPAD 
may explain in part the complex pattern of inheritance 
of the disorder, including its variable penetrance and 
expressivity. Our observations need to be confirmed in 
a group of families ascertained without regard to family 
history. If confirmed, our findings may implicate genes 
containing expanding TNRs in the etiology of BPAD. 
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Summary 

Prior studies describing the relationship between CAG 
hZ7 « Huntington disease (HD) 

liave focused on affected persons. To further define th 
reIat.onsh,p between CAG repeat size and age at ont 
0 HD. we now have analyzed a large cohort of affected 
and asymptomatic at-risk persons with CAG expansion. 
Th.s cohort numbered 1,049 persons, including 321 at 
nsk and 728 affected individuals with a CAG size of 
2^-12 repeats. Kaplan-Meier analysis has provided 
curves for determining the likelihood of onset at a given 
age, for each CAG repeat length in the 39-50 range 
The curves were significantly different {P< .0005), with 
rclanveiy narrow 95o/„ confidence intervals (95^ CI) 
(-10 /o) Penetrance of the mutation for HD also was 
cxammed Although complete penetrance of HD wa 
observed for CAG sizes of ^42. only a proportion o 
tl'ose w,th a CAG repeat length of 36-41 showed s^gn 
or symptoms of HD within a normal life span. These 

hnlT ^°"«'-"i"8 'he likelihood of 

b .ng affected, by a specific age. with a particular CAG 
size, and they may be useful in predictive-testing pro- 
grams and for the design of dinical trials for persons at 
mcreased risk for HD. F"»uns ar 



Introduction 

Himtington disease (HD) is a progressive, neurodegener- 

o vchit ""'^ "^^^^^ disturbances, 

r/«i H ^y^^Pfo"". and cognitive decline (Hayden 
1981; Harper 1991). Although the mean age a onset's 
-40 years of age, 5%-10% of cases have a juvenile 
onset before age 20 years, whereas late onset (at >50 
years of age) occurs in -20% (Myers et al. 1985) The 
disease ,s inexorably progressive, with a duration o 
-15-20 years (Hayden 1981; Harper 1991). 
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The mutation associated with clinical manifestations of 
he disease >s an unstable CAG trinucleotide expansion in 
exon 1 of a novel gene located on 4pl6.3 (Huntington's 
Disease Collaborative Research Group 1993) Whereas 
unaffected persons have a range of 6-35 CAG renParc 
wKh 99% of such individuals hiving <30 repet^, pe J 
affected with HD have CAG sizes of 36-121 to 
(Kremer et al. 1994; Rubinsztein et al. 1996) 

In 1994 The Huntington Study Group, a consortiuni 
of investigators and practitioners across North America 
defined the ranges of CAG repeat length for normal' 
persons as being <30, whereas a CAG of >38 indicated 
a high probability for development of HD (Huntington 
Study Gi-oup 1995). A repeat size of 30-37 was consid- 
ered indetermmate" as to whether the patient would 
develop HD at some time in the future. These guidelines 
were put into place pending more definitive information 
concerning the relationship between CAG length nod 
chnical phenotype of the disease. There have been v ,-i- 
ous interpretations of the range of CAG repeat sizes 
on normal and HD chromosomes (Andrew et al 1993- 

et'ai° 1 n ^'"'"^"'■^ ^"'^ ^^^g^ 1^93; De Rooij 
« al. 1993; Duyao et al. 1993; Goldberg et al.l993^; 
Huntington Disease Collaborative Group 1993; Mac- 
Millan et al 1993; Norremolle et al. 1993; RubiLzte" 

11 ■ iQQ?n " ^^'"^ " 1993; Ziihlke 

et al. 1993; Benitez et al. 1994; Legius et al. 1994- Trot- 
,7 i- 1994; Claes et al. 1995; Soong and "g 
1995), because of different techniques in measuring 
CAG repeat length, leading to difficulty in comparing 
results between studies. Originally the DNA analysis 
consisted of PGR measurement of the CAG repeat, 
which included the CAG and an adjacent CCG repeat 
(Goldberg et al. 19936; Huntington's Disease Collabo- 
rative Group 1993; Vaides et al. 1993). However, since 
!i 'T''?' polymorphic (Rubinsztein et 

ai 1993; Andrew et al. 1994), a more accurate PGR 
echnique which measures only the CAG repeat, has 
been developed and is the preferred technique for pre- 
dictive and diagnostic testing (Riess et al. 1993; Warner 
et al. 1993; Kremer et al. 1995). This has allowed clari- 
fication of the ranges of CAG repeat length in persons 
affected with HD. An analysis of > 1,1 00 affected per- 
sons clearly has shown that HD has not occurred with 
a CAG length <36 repeats (Kremer er al. 1994- Ru- 
binsztein et al. 1996). 
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However, for patients who are informed that they 
have inherited the CAG repeat expansion in the HD 
range (>35), the question often changes from whether 
they will develop HD to when the disease will manifest. 
jsJumerous studies have described a significant inverse 
relationship between CAG repeat size and age at onset 
(Andrew et al. 1993; Barron et al. 1993; Craufurd and 
Dodge 1993; De Rooij et al. 1993; Duyao et al 1993; 
K"/rremolle et al. 1993; Rubinsztein et al. 1993; Simp- 
son et al. 1993; Snell et al. 1993; Stine et al. 1993; 
Telenius et al. 1993; Ziihlke et al. 1993; Ashizawa et 
al. 1994; Kieburtz et al. 1994; Legius et al. 1994; Novel- 
letto et al. 1994; Trottier et al. 1994; Lucotte et al. 1995; 
Masuda et al. 1995; Ranen et al. 1995; Yapijakis et al. 
1995; Brandt et al. 1996), with the CAG repeat length 
being invoked to account for --50% of variation in age 
at onset (Andrew et al. 1993; Stine et al. 1993; Legius 
er 1994). However, these prior studies of the relation- 
ship between age at onset and CAG repeat size have not 
included asymptomatic, at-risk individuals with CAG 
length in the affected range. This makes it impossible to 
take into account the number of asymptomatic persons 
with a particular CAG at a specific age, and thus it 
prevents a complete understanding of the relationship 
between CAG size and age at onset of HD. Both to 
further explore the relationship between CAG repeat 
si: ^ and the age at onset of clinical manifestations of 
HD and to determine the risk of developing HD by a 
certain age in a patient with a particular CAG repeat 
size, we analyzed CAG size in large numbers of affected 
and asymptomatic at-risk individuals. Strict criteria 
were used to determine the age at onset by incorporation 
of careful clinical follow-up of patients. 

Inclusion of the asymptomatic at-risk individuals with 
expanded CAG repeat lengths, along with use of the 
m'-^'-e accurate CAG repeat measurement, allowed us to 
develop estimates of the probability of developing HD, 
by a particular age, with a given CAG size. The large 
numbers of individuals in our database also enabled us 
to clarify the ranges of normal and HD-associated CAG 
repeat sizes. 

HD previously has been considered to be 100% pene- 
trant (Huntington 1872; Hayden 1981; Harper 1991). 
However, recent data (Nance 1996; Rubinsztein et al. 
19^6) suggest that, on rare occasions, HD may indeed 
be nor fully penetrant. This data set also has allowed us 
to study the penetrance of HD at different CAG sizes 
and to clearly show that penetrance is modified by CAG 
size. 

Subjects and Methods 

Subjects 

DNA samples from persons with the diagnosis of HD, 
tl.-ir at-risk family members, and unrelated controls 
have been collected in our laboratory since 1984. To 



date, we have collected information on 4,934 individu- 
als from 1,193 families. A total of 1,593 individuals 
in our database are affected with HD, and 2,244 are 
asymptomatic but at risk (i.e., first- or second-degree 
relatives of an affected individual). Families from Can- 
ada and many parts of the world, including families of 
European, Asian, black South African, Arab, and South 
American origin, are represented. Samples from all fam- 
ily members of affected individuals were recruited ac- 
tively, especially asymptomatic and older first- and sec- 
ond-degree relatives of affected individuals, as part of 
the predictive-testing program at the University of Brit- 
ish Columbia clinical genetics clinic. 

For the purpose of this study we used those individu- 
als with CAG expansions of the upper allele that were 
>28, comprising 728 affected and 321 asymptomatic 
at-risk individuals from 473 families, whose age at onset 
or oldest age while still asymptomatic could be ascer- 
tained. An accurate assessment of the age at onset was 
performed through both a retrospective review of pa- 
tient charts and telephone interviews with patients, fam- 
ily members, genetic counselors, and physicians. Age at 
onset was defined as the first time at which a patient 
had either neurological or psychiatric symptoms that 
represented a permanent change from the normal state. 
The age used for analysis of all asymptomatic individu- 
als was the oldest age when his or her clinical status was 
last directly confirmed, either at the genetics clinic in 
Vancouver or by the local, attending physician. Particu- 
lar attention was paid to confirmation of current age and 
clinical status of all asymptomatic, at-risk individuals in 
the HD database who were >65 years of age. 

DNA Analysis and Assessment of CAG Repeats 

DNA was extracted from leukocytes by standard pro- 
cedures (Kunkel et al. 1977). The CAG repeat was as- 
sessed for all samples by exclusion of the CCG repeat, 
by use of PGR analysis with primers that flanked either 
only the CAG repeat (Kremer et al. 1995) or both the 
CAG repeat and the CCG repeat (Goldberg et al. 
1993b), followed by analysis with primers that flank 
only the CCG repeat (Andrew et al. 1994). Phasing of 
the CCG repeat was performed by pedigree studies, 
when necessary. The CAG repeat size was assessed by 
comparison with sequenced clones of known CAG size. 

Data Analysis 

For each year of age and CAG size, all individuals, 
including both those asymptomatic at-risk and those 
affected, were taken into account to calculate the cumu- 
lative probabilit}' of having onset of HD by that age, by 
use of Kaplan-Meier survival analysis (Splus software 
version 3.1 release 1, 1992; AT&T). Since we had no 
affected persons with CAG length <36 repeats, individ- 
uals with a CAG repeat length >35 were considered to 
be a cohort at risk, from birth to either neurological or 




At risk: 

No. of individuals j 

Average current age (years) 59 

Minimum asymptomatic age (years) 69 

Maximum asympromatic age (years) 69 



SD of current age (vears) 
Affected: 

N'o. of individuals 
Average age at onset (years I 
Minimum age at onset (years) 
Maximum age at onset (years) 
SD of age at onset (years) 
Total no. of individuals 



•9 

XA 21 



8 11 

42 50 38 41 

IS 22 n 6 

83 53 80 

17 12 20 



0 0 0 0 0 



8 11 18 9 



28 13 5 9 4 4 3 
32 29 27 28 27 25 26 
13 16 23 18 19 20 23 
52 40 33 40 39 33 29 
3 



8 9 



0 0 1 
65 
65 
65 
XA 
31 13 



4 2 8 
57 60 61 
53 35 39 
60 S4 71 
3 35 11 
8 II 21 



64 
56 
35 
84 
10 
111 



98 
48 
29 
65 
7 
129 



92 
43 
28 
65 
8 

116 



95 
41 

25 

58 48 
7 6 



63 58 
37 36 
23 19 
45 
5 



39 31 
32 31 
17 17 
45 45 
7 6 



123 76 63 48 35 



13 
26 
16 
34 
6 
16 



5 


321 


19 


39 


16 




23 




3 


lo 


60 


728 


20 


41 


4 


4 


35 


84 


8 


13 


65 


1,049 



sored). The Kaplan-Meier survival curves were com- 
pared by use of log-rank statistics. 

There were only 32 individuals with 36-38 CAG re 
peats, and there were a total of 65 individuals with 

ev^l ^u^- CAG lengths were 

xcluded from the survival analysis, since the small num- 
bers of individuals at each particular CAG size precluded 
rigorous statistical analysis. 

Results 

The distribution of affected and asymptomatic at-risk 
.ndmduals with CAG repeats >28 is shown in table 
1. There were no affected individuals with <36 CAG 
repeats. The ages of asymptomatic persons with a CAG 
repeat size of 30-35 are shown in table 2. A total of 

Table 2 

S-fe of 30-3?" °f A'y'"P»<»"atic Individuals with CAG Repeat 



Age 
(years) 



No. OF Individuals with 
CAG Repeat Size of 



30 
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35 



Total 



0-19 
20-29 
30-39 
40-49 
50-59 
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65-69 
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BS-S9 
90-95 
Total 



1 
3 
1 
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4 

5 
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0 

2 
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1 

2 
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15 
11 
17 
17 
4 
0 
3 
4 
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963 individuals (728 affected) had CAG repeats >^5 
Of these, 866 individuals (90%) from 445 families had 
CAG repeat lengths of 39-50 repeats. The linear associ- 
anon between log age at onset and log CAG size was 
significant {P < .001), with an value of 73 

The cumulative probability of onset, at 5-year inter- 
vals, for a given CAG repeat is shown in table 3, with 
the comp ete age distribution for each CAG being shown 
m figure 1. The mean 95% CI for the probability of age 
at onset with a given CAG repeat by a particular . 
was narrow (±10%). 

As the CAG repeat length increased from 39 to 50 
there was a significant increase in the probability of 
onset (/> < .0005) by a given age. For example, whereas 
an individual with 40 CAG repeats had only a 13% 
probability of having onset by age 45 years, this in- 
creased to 32 /o for someone with 42 CAG repeats, 73% 
for 44 repeats, and 100% for a person with 46 CAG 
repeats (table 3 and fig. 1). 

There were no individuals with a CAG repeat length 
>41 who remamed asymptomatic at >56 years of age. 
This indicated that clinical manifestation of the disease 
was fully penetrant within a normal life span for this 
CAG repeat range. 

There were, however, several individuals with CAG 
repeats of 36-41 who had not manifested with symp- 
toms of HD within a normal expected life span (table 
4). For example, there were two male individual.. 78 
and 85 years of age, with 36 CAG repeats, and there 
was a male 75 years of age with 39 CAG repeats; all 
ot them were asymptomatic. Furthermore, there was a 
female with 38 CAG repeats and a male with 40 repeats 
who were not affected until age 84 years, and there was 
a male with 41 repeats who was not affected until age 
75 years (table 4). All these individuals were of Cana- 
dian origin, except for the 78-year-old male with 36 
repeats, who was of American origin. However, : ' ere 
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Table 3 

Cumulative Probability of Onset at Different Ages, for a Given CAC Repeat Size 

Cumulative PROBABiLi-n' (95% CI) for CAG Repeat Size of 



Age of Subject 39 



15 



40 41 42 



tyg^'-s) jrJ = 21) (« = 111) (« = 98) 



43 44 45 

(« = 123) {n = 76) 



65 1.00 (NA) 



47 48 49 

= 48) („ = 35) („ = 30) 



{rt = 129) 



30 

,r .02(.05-.00) 

7^ (.05-.00) .02 (.05-.00) ,05^ (.09- 01)^ 

-07^ <-20-00)^ .08 (.13-.02) .12 (.18-.05) .14 ( 20- 07) 

Vn (.19-.05) .21 (.30-.12) .32 (,41- 23) 

t-^^-OO) 21 (.30-,12) .38 (.48-.26) .58 (.66-.47) 

J, (.46-.25) .55 (.64-,42) .81 (.87-,71) 

Ji -61 (.70-47) .77 {.85-.65) .99 (1.00- 91) 

•36(.59-.00) .80(.88-.67) .88M.94-.78)^ 1.00 (NA) * 

.68 (,87-.20) .90 (.96-77) .94 (.98-.85) 

(.94-.28)^ .95 (.99-.82) .98 (1.00-.88) 

oO 

l.OOMNA)^ 



46 
{n = 63) 



20 

.03 (.07-.00) 

ii .01 (.02-.00) .05 (.10-.00) .06 (.12- 00) 

(-lO-.Ol) .04 (.08-.01) .17 (.25-.08) .10 (.17-.02 

.18 (.26-,ll) .22 (.30-.14) .37 (.47-.24) .41 ( 52- 27 

,i (.65-.45) .73 (.80-.62) .91 (.96-.80) 1.00 (NA) 

^0 .87(.93-.78) .89(.94-.80) l.OOMNA)^ 

.93 (.97-.85) .96 (.99-.88) 

60 .99^ (1.00-.9l)^ 1.00^ (NA)^ 



50 
{n - 16) 



.07 (.15-.00) 

20 .04 (.lO-.OO) .06 (.13-.00) .30 (.45-.12) .19^ (.36- 00)^ 

.16 (.25-.04) .15 (.26-.02) .41 (.57-.20) .39 (.59-.09) 

(.49-.20) .46 (.61-.26) .53 (.68-.30) .73 (.89- 32 

-64 (.76-.45) .78 (.88-.57) .77 (.89-.53) 1.00 (NA) 

^0 .89 (.95-.72) .89 {M~.69) .95^ (.99-.70)^ 

2f 1-00 (NA) 1.00 (NA) 1.00 (NA) 



nn7.nr T \^"^'J1T' '^^^ ^"P""' ^'^'^ 1 >'^^^ (CAG repeat sizes 39-41 

and 50), or 2 years less (CAG repeat sizes 44 and 45) than the stated interval. 
Value is for an individual 71 years of age. 
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were no instances of nonpenetrance with a CAG length 
' repeats. 

A difference of a single CAG repeat length has a 
significant effect on the expected age at onset for an 
individual. There was a significant linear trend between 
CAG repeat length and median age at onset (? < .001; 
^ = .96), with the median age at onset decreasing by 
3.4 years (±0,2) for each CAG length increase in the 
39-50 range, as shown in table 5. For example, al- 
though only 50% of persons with 40 CAG repeats will 
be iffected by age 59 years, this decreases to age 37 
years for 45 CAG repeats and to age 27 years for 50 
CAG repeats. 



To assess the effect of any possible bias introduced 
by the inclusion of multiple individuals from 445 fami- 
lies, we randomly selected only two individuals and re- 
peated the analysis. There was no significant difference 
in the results obtained for 39-49 CAG repeats, sug- 
gesting that we did not introduce any obvious bias into 
our database by including two or more individuals from 
any one family for this CAG range. 

Discussion 

Using a large cohort ascertained uniformly and ana- 
lyzed by use of the same accurate measures for CAG 
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Figure 1 Representations of cun^ulative probability of being affected, at a particular age, for a CAG of 39-50. Error bars represent 95% CI. 



size, this study has clearly shown that CAG repeat length 
IS the major determinant of age at onset in HD. By 
assessing the CAG size alone, we were able to predict 
the likelihood that an individual would be affected by 
a particular age, with relatively narrow 95% CI (±10%) 
for the vast majority of persons being tested. This study 
confirms the prior assessments of ranges of CAG repeat 
length in affected persons and clearly shows that reduced 
penetrance for HD may occur only for a CAG repeat 
length <42. The degree of association between CAG 
size and age at onset was significant (P < .001), with 



an of .73, which is higher than prior estimates of -.6 
(Andrew et al. 1993; Duyao et al. 1993; Ranen ei al. 
1995), probably because of definitive determination of 
CAG size and confirmation of clinical status. 

The inverse relationship between the increased CAG 
repeat size and age at onset for other diseases associated 
with CAG expansion in novel genes is well documented 
(La Spada et al. 1992; Jodice et al. 1994; Koide et al. 
1994; Nagafuchi et al. 1994; Ranum et al. 1994; Ko- 
mure et al. 1995; Maciel et al. 1995; Maruyama et al. 
1995; Takiyama et al 1995). For HD, since the rr- .ge 
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Table 4 

Estimation of Penetrance of CAG Expansion in HD Gene, 
by CAC Repeat Size 



CAG 

Repeat 

Size 



No. of 
Affected 
Individuals 



No. of Unaffected 
Individuals, Age >75 
Years (Males) or >80 
Years (Females) 



Total 



<29-35 

36 

37 

38 

39 

40 

41 

>41 



0 
1 
4 
2 
8 
64 
74 
575 



9 
2 
0 
1 
1 
1 
1 
0 



9 
3 
4 

3^ 
9' 
65 
75 
575 



Including individuals reported by Rubinsztein et al. (1996). 



of age at onsets for a particular CAG repeat size in 
the HD gene was very broad, most authors, including 
ourselves, have recommended against the use of the 
CAG repeat size to predict the age at onset for an indi- 
vidual patient (Andrew et al. 1993; Barron et ah 1993; 
Craufurd and Dodge 1993; De Rooij et al. 1993; Duyao 
et nl. 1993; MacMillan et al. 1993; Norremole et al. 
lV:'j; Simpson et al. 1993; Stine et al. 1993; Novelletto 
et al. 1994; Trottier et al. 1994). In this article, including 
asymptomatic at-risk as well as affected individuals has 
allowed development of accurate survival curves to pre- 
dict the probability that an individual will be affected 
with HD by a certain age. These curves were based on 
both affected and asymptomatic at-risk individuals with 
CAG repeat sizes of 39-50, which represents 90% of 
individuals in our database who have a CAG repeat size 
in .he range of ^36. The survival curves were signifi- 
cantly different for each CAG repeat size studied, lend- 
ing further support to the importance of the CAG repeat 
size as the predominant factor in determining the age at 
onset. Although these probability curves cannot be used 
to predict the particular age at onset for an individual, 
this analysis may have clinical utility by providing esti- 
mates of symptom-free survival to an individual seeking 
additional information in a predictive-testing program. 

ata generated from this study also may have signifi- 
cant implications for the design of clinical trials for new 
therapeutics. The aim of new therapies in HD is to slow 
or stop the progression of HD in affected persons and 
to delay or prevent onset in persons with CAG >35 
repeats. An appropriate design of clinical trials for in- 
creased-risk individuals will need to take into account 
the expected age at onset of HD for a particular person, 
to determine the potential efficacy of therapy. For exam- 
pl all persons with a CAG repeat length of 46 would 
be expected to manifest with symptoms by 45 years of 
age, with a median age at onset of 36 years (see table 3 



and fig. 1). Extension of age at onset beyond this age 
could indicate a significant therapeutic effect of a partic- 
ular drug. Alternatively, shifting of the median age at 
onset to >36 years also might indicate a therapeutic 
effect. These data indicate the importance of having suf- 
ficient persons with a particular CAG size in a drug trial, 
allowing for more rapid ascertainment of a beneficial 
effect. 

The results of this study, however, should be interpre- 
ted with caution. Although we used a highly accurate 
methodology to calculate the CAG repeat size, this anal- 
ysis cannot be extrapolated immediately to other labora- 
tories, because of possible interlaboratory variability in 
the PGR assays. However, it is somewhat reassuring in 
this regard that comparison of CAG repeat lengths on 
the same samples, between laboratories with significant 
experience in assessment of CAG size, revealed few dif- 
ferences in assessment of CAG repeat size (Marshall and 
Huntington Study Group 1996; Rubinsztein et al. 
1996). 

Another cautionary note is that these results were ob- 
tained by use of affected and unaffected persons from 
families with HD. Therefore, these data may not apply 
equally to asymptomatic individuals in the general popu- 
lation who have no relative with HD but who are found 
to have CAG repeats in the range seen in affected persons 
with HD. However, it is indeed extremely rare to find 
such individuals in the general population, hi addition, 
it should be recognized that a potential bias in the data- 
base is underrepresentation of asymptomatic persons at 
increased risk for developing HD. This would result in 
underestimation of the age at onset in this study. How- 
ever, this analysis did include 661 affected and 205 
asymptomatic individuals at risk, and, although this does 
not represent a complete assessment of CAG in a defined 
population of affected and at-risk individuals, there has 
been no systematic bias in ascertainment of data. 



Table 5 




Median Age at Onset 






Median Age at Onset' (95% CI) 


CAG Repeat Size 


{years) 


39 


66 (72-59) 


40 


59 (61-56) 


41 


54 (56-52) 


42 


49 (50-4S) 


43 


44 (45-42) 


44 


42 (43-40) 


45 


37 (39-361 


46 


36 (3~-35i 


47 


33 (35-31. 


48 


32 (34-301 


49 


2S (32-25) 


50 


27 (30-24) 



"Age by which 50% of individuals will be affected. 
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Our results support the recent findings of Rubinsztein 
et al. (1996), who investigated individuals with 30-40 
repeats. We confirm that the lower limit of CAG repeat 
size in individuals who manifest with HD is 36. This 
lo%ver limit is supported both by the fact that there were 
31 at-risk individuals with a CAG repeat size of 35, 
including one man who was 93 years old, and that all' 
these persons were asymptomatic (table 2). It is now 
justified that the CAG size of 30-35 no longer need 
be considered part of the indeterminate range. These 
mdividuals can be informed that there is no clear docu- 
mentation of any person manifesting with HD who has 
CAG repeats in this range. However, the risk that off- 
spring will develop HD may be increased, particularly 
if the transmitting parent is a male (Goldberg et al 1995- 
Chong et al. 1997). 

Our database includes five males who were asymp- 
tomatic at age &75 years and one female with onset at 
84 years of age (table 4). Penetrance is defined as the 
proportion of individuals with a specified genotype who 
show the expected phenotype under a defined set of 
environmental conditions, which, in this instance, is a 
normal expected life span (King et al. 1992). In the past, 
HD was presumed to be 100% penetrant, with all carri- 
ers of the HD expansion manifesting the disease (Hun- 
tington 1872; Hayden 1981; Harper 1991). The finding 
of reduced penetrance has been raised previously else- 
where (Nance 1996; Rubinsztein et al. 1996). Using on- 
set at 3=75 years for males and 3=81 years for females 
as bemg beyond the normal life span (Statistics Canada 
1995), our analysis of survival curves indicates that there 
IS complete penetrance with a CAG repeat size of &42. 
Furthermore, it is apparent that reduced penetrance may 
occur within the range of 36-41 CAG repeats. Clearly, 
these data need validation in other independently ascer- 
tained large groups of patients, since the numbers are 
too small to allow for meaningful penetrance estimates 
for each specific repeat size. However, it is obvious that 
there is a trend to increasing penetrance with increasing 
repeat length in the 36-41-repeat range: <90% for 39 
CAG repeats and 99% for 41 CAG repeats (table 4). 

The data presented in this article provide significant 
new information for persons and professionals involved 
m predictive-testing programs. Accurate genotyping 
provides assessment as to whether a patient has inherited 
DNA changes associated with HD, but it also can help 
to define the likelihood of being affected at a certain 
age in those individuals with a CAG repeat of 39-50. 
Furthermore, this information also may be useful for 
improvement of the design of clinical trials of therapeu- 
tic agents for increased-risk individuals. 
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Schizophrenia is a common disorder characterized by psychotic 
symptoms; diagnostic criteria have been established^ Family, 
twin and adoption studies suggest that both genetic and envi- 
ronmental factors influence susceptibility (heritabtlity is 
approximately 71%; ref. 2), however, little is known about the 
aetiology of schizophrenia. Clinical and family studies suggest 

0 aetiological heterogeneity^^. Previously, we reported that 
^ regions on chromosomes 22, 3 and 8 may be associated with 
3 susceptibility to schizophrenia^'^ and collaborations provided 
c some support for regions on chromosomes 8 and 22 (refs 9-1 3). 
S We present here a genome-wide scan for schizophrenia suscep- 
g tibility loci (SSL) using 452 microsatellite markers on 54 mufti* 

plex pedigrees. Non-parametric linkage (NPL) analysis provided 
^ significant evidence for an SSL on chromosome 13q32 (NPL 

ti scores 4.1 8; Pr: 0.00002), and suggestive evidence for another 

• SSL on chromosome 8p21-22 (NPL = 3.64; P= 0.0001). Parametric 

^ linkage analysis provided additional support for these SSL. 

Q Linkage evidence at chromosome 8 is weaker than that at chro- 

■c mosome 13, so it is more probable that chromosome 8 may be 

E a false positive linkage. Additional putative SSL were noted 

^ on chromosomes 14q13 (NPL=2.57; P=0.005), 7q11 (NPL=2.S0, 

1 P= 0.007) and 22q11 (NPL = 2.42, /»= 0.009). Verification of sug- 
^ gestive SSL on chromosomes 13q and 8p was attempted in a 
o follow-up sample of 51 multiplex pedigrees. This analysis con- 
5 finmed the SSL in 13q14^33 (NPL=2.36, P=0.007) and sup- 
Q> ported the SSL in 8p22-p21 (NPL=1.95, P=0.023). 

g A genome scan of 54 families was conducted using 452 mark- 

ers covering 22 autosomes, a distance of 3,451 centimorgans 
(cM), with an average resolution of 7.6 cM. We estimate the 
markers covered more than 90% of the genome when compared 
with an independent 3,792-cM map (A.C„ unpublished data). 
Using non-parametric analyses, the most significant allele shar- 
ing (linkage) between schizophrenics in families is on chromo- 
some 13q32 near marker D13S174 (NPL = 4J8, P=0.00002). 
followed by a locus on chromosome 8p21-22 near marker 
D8S1771 (NPL =3.64. P= 0.0001; Fig. 1 and Table 1). Addition- 
ally, chromosomes 7, 14 and 22 also show NPL scores of approxi- 
mately 2.5 and P-values of less than 0.01. Based on previous 
criteria*^, the linkage to chromosome 13q is significant, whereas 
that to chromosome 8p is suggestive. 

Parametric analyses were conducted to assess the likelihood of 
dominant and recessive models. Assuming linkage homogeneity, 
recessive and dominant models could be excluded for all chro- 



mosomes except chromosome 8p, with a peak dominant lod 
score of 3.19 near D8S177}, coincident with the non-parametric 
evidence. Allowing for locus heterogeneity, on chromosome 8p 
the peak dominant heterogeneity lod score is 4.54 (70% of fami- 
lies linked); the peak recessive heterogeneity lod score is 2.01 
(32% of families linked). On chromosome 13q. the peak domi- 
nant heterogeneity lod score is 1.84 (44% of families linked), and 
the peak recessive heterogeneity lod score is 3.19 (48% of families 
linked). For chromosomal regions 7qll, 14ql3and 22qll, which 
have peak NPL scores of 2.5. 2.57 and 2.42 at markers D7S2212, 
D14S306 and D22S1265, respectively, heterogeneity lod scores 
greater than 1 were also observed. Information content was 80- 
90% in regions with evidence of linkage. 

There are three noted caveats for interpreting the parametric 
analyses in our study: (i) models tested cannot be true for all seg- 
regating susceptibility loci; (ii) if multiple loci are involved. link- 



Table 1 • Parametric and non-parametric linkage analysis 





Nonparametric 


Parametric 






Chromosome 






Lod^ 


Lod 


HLOD*' 


HLOD 


Number 


NPL 


P-value 


Dom« 


Rec** 


Dom« 


Rec^ 


1 


1.39 


0.084 


<-2 


<-2 


0.18 


0.89 


2 


1.26 


0.104 


<-2 


<-2 


0.18 


0.48 


3 


1.59 


0.06 


<-2 


<-2 


1.10 


0.47 


4 


0.68 


0.247 


<-2 


<-2 


0.03 


0.001 


5 


1.5 


0.068 


<-2 


<-2 


0.46 


0.75 


6 


1.96 


0.027 


<-2 


<-2 


0.68 


0.77 


7 


2.5 


0.007 


<-2 


<-2 


1.30 


1.16 


8 


3.64 


0.0001 


3.19 


<-2 


4.54 


2.01 


9 


0.84 


0.2 


<-2 


<-2 


0.44 


0.09 


10 


1.87 


0.03 


<-2 


<-2 


0.87 


1.00 


11 


1.27 


0.102 


<-2 


<-2 


0.43 


0.67 


12 


1.01 


0.155 


0.12 


<-2 


1.34 


0.50 


13 


4.18 


0.00002 


<-2 


<-0.40 


1.84 


3.19 


14 


2.57 


0.005 


<-2 


<-1.61 


0.93 


0.44 


15 


0,9 


0.182 


<-2 


<-2 


0.43 


0.01 


16 


0.59 


0.276 


<-2 


<-2 


0.06 


0.08 


17 


1.9 


0.03 


<-2 


<-1.84 


0.41 


0.73 


18 


0.97 


0.166 


<-2 


<-2 


0.16 


0.47 


19 


1.62 


0.053 


<-2 


<-2 


0.78 


0.41 


20 


1.22 


0.112 


<-2 


<-2 


0.41 


0.01 


21 


1.08 


0.139 


<-2 


<-2 


0.34 


0.004 


22 


2.42 


0.009 


<-2 


<-2 


1.81 


2.10 



°Lod, lod score; "HLOD, heterogeneity lod score; ^Dom, assumed dominant 
mode of inheritance; ^'Rec, assumed recessive mode of inheritance. 
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Fig. 1 Genome-wide scan for schizophrenia susceptibility loci 
(SSLs). Each tx>x contains a figure where the x-axis represents the 
genetic distance In cent) morgans for one of the twenty-two auto- 
somes and the y-axis represents the multipoint non-parametric 
linkage (NPL) score (Z-all). 
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age heterogeneity must exist; and (iii) failure to detect 
linkage is a rejection of the model tested, rather than 
of linkage itself. First, non-parametric analysis 
demonstrates significant linkage on chromosome 13 
(P- 2.2x10'^) and suggestive linkage on chromo- 
some 8 (P= 1.9x10-'^), but the dominant model 
(assuming homogeneity) gives a lod score of 3.2 on 
chromosome 8 and neither model shows significant 
linkage on chromosome 13. Once linkage heterogene- 
ity is acknowledged, the dominant model has a lod 
score of 4.5 on chromosome 8 and the recessive model 
a lod score of 3.2 on chromosome 13. The two suscep- 
tibility loci show differing patterns of inheritance. 
Second, evidence of linkage from parametric analysis 
only appears upon assuming heterogeneity. The pro- 
portions of families demonstrating linkage to chro- * 
mosome 8 and 13 loci are 70% and 48%. respectively. 
The total fraction of families showing linkage is 1 1 8%. ' "^"^'"^ 
suggesting segregation of multiple factors in the same ' " ' 
family. Linkage evidence at chromosome 8 is weaker 
than that at chromosome 13, so chromosome 8 has a greater 
probability of being a false positive linkage than chromosome 13. 
Third, weak linkage on chromosome 22q, by both non-paramet- 
ric and parametric analyses, suggests that the models chosen 
were inappropriate for detecting linkage, and a small genetic con- 
tribution of this locus in these families. 

NPL and lod score analyses on chromosomes 8 and 13 (Fig. 2) 
show that peak evidence is coincident on each chromosome for 
both non-parametric and parametric analyses, and that peak 
location is unchanged when heterogeneity in linkage is assumed. 
Heterogeneity fits the data (chromosome 8. xf = 6.22, P<0.02; 
chromosome 13. xf = 16.53, P<0.001). suggesting that all 54 
families do not segregate the putative factors on chromosome 8 
and 13. A 95% confidence interval for the region size implicated 
by parametric linkage analysis is 18 cM for both chromosomes 8 
and 13, assuming the dominant and recessive model, respectively. 
Enlargement of the implicated region for chromosome 8p com- 
pared with our previous report^ may be explained by several fac- 
tors: (i) a 17-cM interval in the previous map that contained one 
marker {D8S136) has been made more informative by including 
genotypings of 13 additional polymorphisms, including the 
locus with the current peak evidence (D8S1771}: (ii) equal allele 
frequencies were used for all markers in previous reports, 
whereas exact allele frequencies are used currently; (iii) 1 1 fami- 
lies with little linkage information in the previous sample were 
replaced with eight potentially more informative families; and 
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(iv) multipoint analyses to assess relative pair sharing of alleles 
increases the amount of linkage information in an interval com- 
pared with prior use of two-point analyses. 

To add support, we analysed 51 additional families for 17 chro- 
mosome 8p22-p21 markers (32 cM) and seven chromosome 
13ql4-q33 markers (16 cM). For the 8p region, the peak NPL 
was 1.95 (P=0.023) at D8SI752, approximately 3 cM telomeric to 
the D8S1771 peak in the 54 families. The peak dominant hetero- 
geneity lod score was 0.68 also at D8S1752. For the 13q region, 
the peak NPL was 2.36 (P=0.007) at D13S779. approximately 7 
cM telomeric to the peak at D13S174 in the 54 families. The peak 
recessive heterogeneity lod score was 0.61 at DISS 1284, approxi- 
mately 5 cM centromeric to the original peak. For both regions, 
peaks in the NPL curves and in the heterogeneity lod score curves 
were observed in a flat region extending more than 5 cM. 

The difference in the degree of significance between primary 
and follow-up samples is probably due to random variation in 
the proportion of linked families at a given SSL. If the true pro- 
portion of linked families at a given SSL is 50%, then in a new 
sample of 50 pedigrees, the proportion of linked families would 
vary 36-64% with 95% probability. 

Our study differs from others in several respects. First, we used a 
narrow definition of the illness. This may have reduced our statisti- 
cal power: however, it also may reduce the false positive rate by 
decreasing the number of statistical tests performed. Second, most 
families in the primary sample were ascertained from probands in 



Fig. 2 Multipoint non- para metric and parametric 
analyses for chromosome 8 (a) and chromosome 13 
ib) from a genome-wide scan for SSLs. The x-axIs 
represents the genetic distance along the chro- 
mosome in centi morgans and the y-axis is used to 
depict the parametric lod score (blue) and the het- 
erogeneity lod score (red), and non -para metric link- 
age (NPL) score or Z-ail (black). For chromosome 8, 
parametric lod and heterogeneity lod score were 
obtained using the dominant model. For chromo- 
some 13, parametric lod and heterogeneity lod 
score were obtained using the recessive model. 
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Table 2 • Analysis of 10 SZA subjects 





Nonparametric 


Parametric 




Chromosome number 


NPL 


P- value 


Lod« 


HLOD^ 


8p: 10 SZA affected 


3.50 


0.0001 


2.16(Dom*=) 


4.14 (Dom) 


10 SZA unknown 


3.56 


0.0001 


3.27 (Dom) 


4.54 (Dom) 


13q:10SZA affected 


4.18 


0.00002 


<-0.40 (Rec^) 


3.19 (Rec) 


10 SZA unknown 


3.25 


0.0004 


<-2.50 (Rec) 


1.93 (Rec) 



»Lod, lod score; *'HLOD, heterogeneity lod score; 'Dom, assumed dominant 
mode of inheritance; **Rec, assumed recessive mode of inheritance. 



a systematic sample; affected avuncular pairs were included in 
addition to affected sibpairs. Third, longitudinal as well as cross- 
sectional clinical data about each subject from multiple sources 
was obtained. Longitudinal information helped in differentiation 
of the psychotic disorders. Fourth, marker maps used loci for 
which location had high statistical support, because multipoint 
disease mapping is sensitive to local order and map distances. 

In the process of completing the genome scan, linkage reported 
by others on 6p, 15q and 5q (refs 15-18) was followed up in our 
sample We failed to detect linkage in these additional 
regions. Modest evidence for linkage on chromosome 13ql4- 
q32 has been reported^^"^^. 

To test for linkage heterogeneity, we classified the 54 families in 
the primary sample according to the expression of psychotic affec- 
tive disorders or schizophrenic spectrum personality disorders 
(for example, schizoid, schizotypal and paranoid personality dis- 
orders) in their relatives. Six families had 17 relatives with psy- 
chotic affective disorders and eight families had 13 relatives with a 
schizophrenia spectrum personality disorder. Multipoint NPL 
analyses suggested that families segregating schizophrenia spec- 
trum disorders contributed to linkage findings on both 13q32 
(P=0,03) and 8p21 (P=^0.00001). There was a negligible impact 
(P-O.IS) of families segregating psychotic affective disorders on 
the chromosome 8 findings, but this group contributed chromo- 
some 13 findings (P=0.02). The latter data are consistent with 
evidence for linkage between bipolar illness in multiplex families 
and markers from chromosome 13q32 (S.D. Detera-Wadleigh, 
pers. comm.: J. Kelsoe, pers. comm.). Subtyping was not carried 
out for the follow-up sample. The number of individuals initially 
seen in the follow-up sample Included only those possibly affected 
and their parents. 

To address possible misclassification in the diagnosis of 
schizoaffective disorder (SZA), we reanalysed the genome scan 
data on chromosomes 8 and 13, treating the 10 SZA subjects (in 
nine families) as phenotype unknown. The NPL, P-value. lod 
score and heterogeneity lod score (Table 2) shows that for chro- 
mosome 8. non-parametric evidence remains unchanged; how- 
ever, the increase in parametric evidence suggests that SZA 
subjects do not contribute to the chromosome 8 evidence. Treat- 
ing 10 SZA subjects as unknown in the reanalysis of chromosome 
13 reduced both parametric and non-parametric evidence, sug- 
gesting SZA classification contributed to the genome scan find- 
ings for chromosome 13q. These data are consistent with our 
heterogeneity subtyping analyses. 

To better address the multigenic hypothesis, we computed the 
correlation coefficient between the NPL scores at the peak loca- 
tions in the chromosome 8 and 13 SSL for 54 families. The NPL 
score is independent of any genetic model assumed. In a model of 
linkage heterogeneity, where some families are linked to the chro- 
mosome 8 SSL and others to the chromosome 13 SSL. some fam- 
ilies by chance may show segregation at both loci, although the 
expected correlation must be negative. The observed correlation 
of 0.12 is positive but not significantly different from zero 



(P>0,05). We also classified the 54 families based on their NPL 
scores at chromosome 13 (the SSL with the highest linkage evi- 
dence). Thirty-six families had positive chromosome 13 NPL 
scores (group 13+) and 18 families had zero or negative NPL 
scores (group 13-). The peak chromosome 8 NPL score, P-value, 
lod score and heterogeneity lod score for group 13+ was 3.0, 
0.0015, 2,3 and 3.2, and for groupl3- was 2. 1, 0.0263. 1.7 and 1.9. 
Given that group 13- is one- half the sample size of group 13+ and 
is expected to have lower information content, these data show 
that the chromosome 8 SSL evidence arises from both groups. 
The greater chromosome 8 SSL evidence arises in families likely 
to be segregating the chromosome 13 SSL. This evidence favors 
the hypothesis of multigenic inheritance. 

The identification of chromosomal regions on 13q32 and 8p21 
that are linked to schizophrenia susceptibility is a step towards 
identifying the genetic aetiology of the disease. The target regions 
need to be reduced in size by studying additional famiUes, recog- 
nizing subtypes that show greater segregation with one or more 
SSL and saturating the regions with more markers for association 
and disequilibrium studies. 

Methods 

Pedigree ascertainment. We identified multiplex families through a sys- 
tematic sample of schizophrenic patients in the Maryland epidemiologic 
sample (MES; 44 families of European descent). Details of the MES are 
avallable^^'^^. Addldonal families were identified through nationwide 
recrulUnent efforts in the United States (31 families of European descent. 
6 African- American, 5 Ashkenazlm, 1 Amlsh) and international collabora- 
tions (10 Italian, 3 Polish, 5 Greek). Fifty-seven families were used in our 
previous report of potential linkage to chromosomes 3 and 8 (ref. 8). The 
current genome scan sample of 54 families was created by omitting 1 1 fam- 
ilies in which the only affecteds were a parent and a child and adding eight 
affected slbpalr or slb-tripIet families. 

The genome scan and follow-up samples have a similar number of 
affecteds per pedigree (2.7 versus 2.6) and the same sex distribution among 
affecteds (three males for every two females). The two samples differed in 
number of genotyped individuals (363 (80%) versus 289 (56%)), age at 
onset (>35 yr, 15% versus 3%), proportion of families with affecteds in 
more than one generation (50% v^ersus 22%) and the inclusion of biUneal 
families (0% versus 4%). These differences may be caused by the majority 
of families in the follow-up sample being ascertained through recruitment 
efforts focusing on IdenUfylng families with two or more affected siblings. 
In contrast, the Initial genome scan sample was obtained through a system- 
atic sample of families In which the proband and at least one other first or 
second degree relative was affected. 

Clinical assessment methods. An examiner (psychiatrist or psychologist) 
evaluated the subject's medical and psychiatric history by administration of a 
semi-structured Instrument Personality characteristics were rated by admin- 
istration of another semi-structured instrument. All interviews were audio- 
taped and in addition to the direct Interview of each subject, informants were 
also interviewed. Diagnoses were made according to DSM-IV criteria. 

Final diagnoses and age at onset were assigned through a consensus pro- 
cedure^. If there were any disagreement, the two diagnosticians met to 
resolve any differences. In a small percentage of cases, a third diagnostician 
was asked to review a case. It was a rare occasion that consensus could not be 
reached (less than 1% of cases), requiring an Individual to be removed from 
the analysis. We limited our analyses to a narrow and conservative definition 
of affected (that is. schizophrenia or schizoaffective disorders). Eight per- 
cent of the affecteds in 105 families (22 of 276 affecteds) were diagnosed as 
schizoaffective. Schizoaffectlves did not differ from schizophrenics when 
comparing age at onset, lineal Ity, course or gender. Each multiplex family 
had at least one schizophrenic and one other first or second degree relative 
diagnosed with schizophrenia or schizoaffective disorder. Individuals who 
were given a diagnosis of another psychoUc disorder or a schizophrenia 
spectrum personality disorder were classified as unknown In all analyses. 

Markers and maps. In this study we used 452 polymorphic microsatelHte 
markers with average heterozygosity of 76% (range 55-94%) . The majority 
of these markers were dlnucleotide repeat polymorphisms, the remainder 
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being tri- and tetranucleotide repeats. Information regarding the oligonu- 
cleotide primers used for genotyping. heterozygosity, allele sizes and fre- 
quencies were obtained from databases (http://www.chlc.org. http://www. 
genethon.fr. http://www.marshmed.org) . 

No single map that incorporated all loci was available because the genetic 
markers used arose from multiple sources. Further, accurate map distances 
between adjacent markers are needed for multipoint linkage analysis due to 
its sensitivity to false double crossovers arising from genotyping errors^*"^^. 
Consequently, we reconstructed linkage maps of all human chromosomes 
using the markers we had chosen^^ based on their published genotype data 
in the CEPH reference pedigrees (http://wwwxeph.fr). The de novo human 
genome genetic linkage map was constructed using MultlMap software^^. 
Maps were constructed with odds of support of 1000:1 or greater for mark- 
ers ranked by heterozygosity. All markers that did not map into a single 
1000:1 interval were localized to the most likely interval with 100:1 odds. 
Further details of the algorithms used for map construction are avallable^^. 
Details concerning the map are available upon request. 

Genotypes of polymorphic markers. Genotypes at 452 polymorphic 
markers were obtained from 363 individuals in the genome scan and at 24 
markers from 289 individuals in the follow-up sample. CEPH individuals 
1333-01 and 1333-02 were genotyped as controls for allele size. To aid in 
map construction, eight CEPH families were genotyped for a few markers. 
For genotyping, all markers were assayed by PGR and polyacryiamide gel 
electrophoresis. Initially, all alleles were coded within each family. In 
regions suggestive of linkage on chromosomes 8,13 and 22 with non-para- 
metric P- value less than 0.01, marker alleles were recoded across families 
by allele size. For these latter markers, allele frequencies were estimated 
from 106 unrelated individuals in the genome-scan sample and from 82 
unrelated individuals in the follow-up sample. Laboratory procedures were 
repeated if necessary to minimize missing genotypes. On average, less than 
0.025% of all genotypes for any one marker were missing. 

Linkage analysis and genome scanning. We performed parametric and 
non-parametric multipoint linkage analyses using GENEHUNTER version 
l.l (ref. 29). GENEHUNTER estimates the statistical significance of shar- 
ing alleles Identical- by-descent between all affected individuals, as well as 
how much of the total genetic information in a segment has been extracted 
from the markers studied. 

We initially screened the whole genome and performed non- parametric 
linkage analysis. All marker loci were assumed to have equal allele frequen- 



cies, and analysis was based on the map we constructed. Muklpoint linkage 
analysis used GENEHUNTER providing the polntwise P-value across each 
chromosome. These calculations used the non-parametric Z-all statistic, 
which computes the joint Identity-by-descent probability among all 
affecteds in a family. The P-values presented were obtained from GENE- 
HUNTER and were derived by computing the exact probability distribu- 
tion of the overall Z-all score under the null hypothesis of no linkage. A 
normal approximation was not used. For regions of chromosomes 8. 13 
and 22 with P- values less than 0.01, we genotyped additional markers and 
used the estimated allele frequencies to repeat the analysis. Finally, we also 
conducted parametric analysis under the assumptions of both homogene- 
ity and linkage heterogeneity. The disease inheritance model used assumed 
a population prevalence of approximately 1% with approximately 10% 
phenocopy rate and Hardy-Weinberg equilibrium. For the dominant 
model this corresponded to penetrances of 0.65, 0.65 and 0.0096 for the 
mutant homozygote. mutant heterozygote and wild-type homozygote, 
respectively, and a mutant allele frequency of 0.005; for the recessive model 
this corresponded to penetrances of 0.65. 0.0096 and 0.0096 for the mutant 
homozygote, mutant heterozygote and wild-type homozygote, respectively, 
and a mutant allele frequency of 0.11. Confidence intervals for the size of 
the regions Implicated by linkage were based on the results of parametric 
analysis and the most likely model using the l-unlt-down method^. In this 
method, all map locations which give a score above Z-1 are considered. 
Z being the peak lod score. 

For both types of analyses, we set the genome-wide false positive rate at 
5% and used established criteria '1 Linkage evidence at a single point In the 
genome is considered significant whenever the P-value is 2.2x10-^ or 
smaller and/or the non-parametric lod score is 3.6 or larger. The evidence 
is considered suggesUve whenever the P-value is between 2.2 x 10*5 and 
7.4 X 10"^ and/or the non -parametric lod score is 2.2-3.6. 
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Abstract Previously we reported suggestive evidence for 
linkage of schizophrenia to markers on chromosome 
13ql4.1-q32. We have now studied an additional inde- 
pendent sample of 44 pedigrees consisting of 34 Tai- 
wanese, 9 English and 1 Welsh family in an attempt to 
replicate this finding. Narrow and broad models based on 
Research Diagnostic Criteria or the Diagnostic and Statis- 
tical Manual of Mental Disorders, third edition, revised, 
were used to define the schizophrenia phenotype. Under a 
dominant genetic model, two-point lod scores obtained 
for most of the markers were negative except that marker 
D13S122 gave a total lod score of 1.06 (0= 0.2, broad 
model). As combining pedigrees from different ethnic ori- 
gins may be inappropriate, we combined this replication 
sample and our original sample, and then divided the total 
sample into Caucasian (English and Welsh pedigrees) and 
Oriental (Taiwanese and Japanese pedigrees) groups. The 
Caucasian pedigrees produced maximized admixture two- 
point lod scores (A-Iod) of 1.41 for the marker D13S119 
(0= 0.2, a= 1.0) and 1,54 for D13S128 (0= 0, a= 0.3) 
with nearby markers also producing positive A- lod 
scores. When five-point model-free linkage analysis was 
applied to the Caucasian sample, a maximum lod score 
of 2.58 was obtained around the markers D13S122 and 
D13S128, which are located on chromosome 13q32. The 
linkage results for the Oriental group were less positive 
than the Caucasian group. Our results again suggest that 
there is a potential susceptibility locus for schizophrenia 
on chromosome 13ql4.1-q32, especially in the Caucasian 
population. 
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Schizophrenia is a serious mental illness that may be 
caused by abnormal brain development (Jones and Mur- 
ray 1991). The lifetime morbid risk worldwide ranges 
from 0.5 to 1% (Gottesman 1991). The aetiology of schizo- 
phrenia is unclear, but family, twin and adoption studies 
have shown that genetic factors account for approxi- 
mately 70% of the variance in liability (Gottesman 1991). 
Nevertheless, the mode of inheritance of the disorder re- 
mains unclear. Although recent statistical modelling sug- 
gests at least three common genes that act multiplicatively 
on the risk of illness (Risch 1990), the existence of major 
genes in some families remains a possibility. Linkage 
studies have identified several "hot-spots" for schizophre- 
nia: chromosome 22 (Pulver et al. 1994a, b; Lasseter et al. 
1995; Vallada et al. 1995; Schizophrenia Collaborative 
Linkage Group 1996). chromosome 3p (Pulver et al. 
1995), chromosome 8p (Pulver et al. 1995) and chromo- 
some 6 (Moises et al. 1995; Schwab et al. 1995; Straub et 
al. 1995; Wang et al, 1995), 

Previously we reported suggestive evidence for link- 
age of schizophrenia to markers on chromosome 13ql4. 1- 
q32 (Lin et al, 1995), and recently two other groups have 
found suggestive evidence for this region (Antonarakis et 
al. 1996; Kalsi et al, 1996), In order to follow up this pro- 
visional finding, an additional independent sample of 44 
pedigrees (9 English, 1 Welsh and 34 Taiwanese) were 
collected. The British small -to-moderate size families 
were ascertained by referral of patients with schizophre- 
nia who had at least one known, living, first-degree rela- 
tive affected with the same illness. The Taiwanese nuclear 
families were recruited by referral of patients with schizo- 
phrenia who had at least one known, living sibling af- 
fected with the same disease. These studies were ap- 
proved by the appropriate hospital ethics committees and 
informed consent was obtained from all study subjects. 
Diagnoses are based on Research Diagnostic Criteria 
(RDC); (Spitzer et al. 1978) for the Welsh and English 
pedigrees and the Diagnostic and Statistical Manual of 
Mental Disorders, third edition, revised (DSM-III-R); 
(American Psychiatric Association 1987) for Taiwanese 
families. A total of 224 individuals including 93 with 
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Table 1 Two-point linkage results of admixture (A-Lod) and model-free lod (MFLOD) scores of schizophrenia for markers on chro- 
mosonte 13 with the narrow model f or replication sample (n = 44), Caucasian group (n = 21) and Oriental group {n = 36) 

Marker 



Replication sample 



Caucasian group 



Oriental group 





A-Lod 






MFLOD 
Lod score 


A-Lod 






MFLOD 
Lod score 


A-Lod 






MFT on 
Led score 


A-Iod 
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A-lod 


0 
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A-lod 
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D13S126 


0,58 


0 


0.45 


0.024 
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u.uuo 


0.79 
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0.55 


0.04 


HTR2A 


0.02 


0 


0.15 
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0 


0.30 


D13S119 


0.02 
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0.05 


0 


1.41 


0.2 


1.0 


1.19 


0.14 


0 


0.20 


0 


D13S144 
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0 


0 


1.16 


0 


0.35 


1.18 


0 


0 


0 


0 


D13S160 


0.34 
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0,30 


0.72 


0.86 


0 


0.35 


0.72 


0.11 


0 


0.15 


0,51 
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0.24 
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0.30 


1.72 
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0.05 


0.28 


D13S64 
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0.05 
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1.03 
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0.75 
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0.18 
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0 
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0.90 


0.09 


0.12 


0 


0.15 


0 



schizophrenia and 17 with other psychiatric disorders 
were studied. Ten highly polymorphic microsatellite 
markers and the biallelic HTR2A marker (Warren et al. 
1993), which showed positive lod scores from our previ- 
ous findings, were genotyped using standard polymerase 
chain reaction (PGR) techniques (Saiki et al, 1985). Geno- 
typing data was first used to construct a genetic map using 
the program CRI-MAP (Lander and Green 1987). This 
enabled potential errors appearing as double recombina- 
tion events in a small genetic distance to be identified and 
checked. Several rounds of checking and map building 
were followed until a consistent map was obtained. 

Two diagnostic models based on RDC or DSM-III-R 
were used to define the disease phenotype. A narrow 
model consisted of RDC or DSM-III-R schizophrenia as 
affected and a broad model included diagnoses of schizo- 
phrenia, schizoaffective disorder and unspecified func- 
tional psychosis. Two-point lod score analyses were per- 
formed using the program MLINK from the LINKAGE 
package (Lathrop et al. 1984) and tabulated family by 
family. As locus heterogeneity was expected, the admix- 
ture lod score (A-lod), which assumes that a proportion a 
of families are linked, was calculated for each marker. Al- 
lelic frequencies were calculated from 79 unrelated indi- 
viduals in the pedigrees. A dominant model was used with 
a disease gene frequency (q) of 0.008, with penetrance (/"i, 
fj) of 0.5 and sporadic risk ifo) of 0.005 for the narrow 
model and 0.7 and 0.01 for the broad model. When the 
narrow model was used, subjects who would have been 
counted as unaffected under the broad model were desig- 
nated as phenotype unknown. In this replication sample, 
two-point total lod scores obtained for most of the mark- 
ers were negative at a variety of recombination fractions 
with both diagnostic models except for the marker D13S 122, 
which produced a lod score of 1.06 (0 = 0.2, with the 
broad model). Because of the uncertainty of the mode of 
transmission of the disease, the model-free method of 
linkage analysis, MFLINK (Curtis and Sham 1995), was 
also applied to our data. Two-point admixture and model- 



free lod (MFLOD) scores for markers on chromosome 1 3 
with the narrow model for the replication sample are 
shown in Table 1 . 

As genes predisposing to a disease may differ between 
ethnic groups, combining samples from different ethnic 
origins may be inappropriate. Therefore we divided this 
replication sample and our original sample into Caucasian 
(English and Welsh) and Oriental (Taiwanese and Japan- 
ese) groups. The 21 Caiicasian families used in this group 
consisted of 171 individuals, including 61 with schizo- 
phrenia and 17 with other psychiatric disorders. The 36 
Oriental families were composed of 193 individuals, in- 
cluding 85 with schizophrenia and 9 with other psychi- 
atric disorders. 

The Caucasian pedigrees gave total two-point lod 
score of 1.41 and 1.47 for markers DI3S119 and 
D13S128 at ^= 0,2 with the narrow model, respectively. 
Allowing for heterogeneity, the Caucasian pedigrees pro- 
duced maximized A-lod scores of 1.41 for marker 
D13S119 (0= 0.2, 1.0), 1.16 for D13S144 (^= 0, a= 
0.35), 1.19 for D13S122 (0= 0, a= 0.45), and 1,54 for 
D13S128 (^= 0, £ir= 0.3) with nearby markers also pro- 
ducing positive A-lod scores under the narrow model. A 
two-point model-free lod score of 1.72 was also obtained 
for D13S128. When we looked at family-by-family link- 
age results, the majority of the lod score for D13S128 was 
contributed by two families (lod score = 1.59 for 
CAR017, lod score =1.41 for INS035). The results of 
two-point admixture and model-free lod scores of schizo- 
phrenia for markers on chromosome 13 for the Caucasian 
sample with the narrow model are shown in Table 1. 

When five-point model-free linkage analysis was ap- 
plied to the Caucasian sample, a lod score of 2.58 (narrow 
model, as shown in Fig. 1) was obtained around markers 
D13S122 and DOS 128, which are located on chromo- 
some 13q32. A small peak of lod score of 1.09 was also 
observed near markers HTR2A and D13S119. Twelve- 
point non-parametric linkage (NPL) analysis using the 
program GENEHUNTER (Kruglyak et al. 1996) was per- 
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Fig. 1 Multipoint linkage results of schizophrenia for markers on 
chromosome 13 with narrow model for the Caucasian pedigrees (n = 
21) and the Oriental pedigrees (n = 36). NPL are 12-poinl non- 
parametric linkage analysis using the program GENEHUNTER. 
NPLC indicates led scores for the Caucasian group and NPLO for 
the Oriental group. MFLOD are 5-point model-fipee lod scores us- 
ing the program MFLINK coupled with VTTESSE. MFLODC 
indicates model-free lod scores for the Caucasian group and 
MFLODO for the Oriental group 

formed. For ease of comparison, the NPL statistics and P 
value produced by GENEHUNTER were transformed to 
lod score equivalents. A maximum NPL lod score of L54 
was obtained on the same region as the five-point 
MFLOD results for the Caucasian group (Fig. 1). The in- 
heritance information extracted from the 1 1 markers used 
for the multipoint NPL analysis ranges from 0.74 to 0.91. 
Although the 12-point non-parametric lod scores obtained 
were less positive than the 5-point MFLOD scores, the pat- 
tern of the NPL results is similar to the MFLOD results. 

The linkage results for the Oriental group were less 
positive than for the Caucasian group. Under the assump- 
tion of heterogeneity, a maximum two-point lod score of 
0.79 for the marker D13S126 (narrow model) was obtained 
(Table 1). In the Oriental group, a region around markers 
D13S160 and D13S121 gave a five-point MFLOD score 
of 0.43 (Fig. 1). Twelve-point NPL results for the Oriental 
group shown in Fig. 1 do not suggest linkage. 

The lod scores obtained from the replication sample 
are less impressive than our original findings (an approx- 
imate multipoint lod score of 2 was the maximum ob- 
tained) with both diagnostic models. However, when the 
combined sample is divided into different ethnic groups, 
the results from the Caucasian group, which included ten 
additional families that are not part of our original sample, 
are consistent with our previous findings. This was not the 
case for the Oriental group. We may speculate therefore 
that the susceptibility locus for schizophrenia in Cau- 
casians may be different from that in Orientals. 

The lod score we obtained is below the traditional crit- 
ical value of 3 for linkage; however, it has been pointed 
out that some true susceptibility loci may never show sig- 



nificant linkage because they confer a very small in- 
creased risk and have common alleles (Owen and Crad- 
dock 1996). Two other groups have also found suggestive 
evidence in this region: Antonarakis et al. (1996) obtained 
a lod score of 2.54 for D13S128 using a dominant model 
and Kalsi et al (1996) a lod score of 1.09 for DISS 144, It 
remains possible that there is a dominant susceptibility lo- 
cus for schizophrenia in a proportion of families, espe- 
cially for subjects of Caucasian ethnicity. An international 
collaboration and meta-analysis may be required to eluci- 
date this suggestive linkage evidence to markers on the 
chromosome 1 3q 14. l-q32 region. 
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Summary 

Over the past 4 years, a number of investigators have 
reported findings suggestive of Hnkage to schizophrenia, 
with markers on chromosomes 13q32 and 8p21, with 
one recent study by Blouin et aL reporting significant 
linkage to these regions. As part of an ongoing genome 
scan, we evaluated microsateUite markers spanning 
chromosomes 8 and 13, for linkage to schizophrenia, in 
21 extended Canadian famiHes. Families were analyzed 
under autosomal dominant and recessive models, with 
broad and narrow definitions of schizophrenia. All mod- 
els produced positive LCD scores with markers on 13q, 
with higher scores under the recessive models. The max- 
imum three-point LCD scores were obtained under the 
recessive-broad model: 3.92 at recombination fraction 
{$) .1 with D13S793, under homogeneity, and 4,42 with 
a= .65 and ^ = 0 with D13S793, under heterogeneity. 
Positive LCD scores were also obtained, under all mod- 
els, for markers on Bp. Although a maximum two-point 
LCD score of 3.49 was obtained under the dominant- 
narrow model with D8S136 at ^ = 0.1, multipoint anal- 
ysis with closely flanking markers reduced the maximum 
LCD score in this region to 2.13. These results provide 
independent significant evidence of linkage of a schizo- 
phrenia-susceptibility locus to markers on 13q32 and 
support the presence of a second susceptibility locus on 
8p21. 



Introduction 

Schizophrenia is a serious neuropsychiatric illness af- 
fecting —1 % of the general population. Family, twin, and 
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adoption studies have demonstrated that schizophrenia 
is predominantly genetically determined and has high 
heritability (McGuffin et al. 1994). No gene for schizo- 
phrenia has yet been identified, and no significant link- 
age finding has yet been reproduced in an independent 
sample (Moldin 1997). Two regions that have recently 
yielded reports of significant linkage to schizophrenia 
are chromosomes 13q32 and 8p21 (Blouin et al. 1998). 
Several groups had previously reported positive, but not 
significant, findings of linkage to markers on chromo- 
somes 13q (Lin et al. 1995; Pulver et al. 1996; Lin et 
al. 1997; Straub et al. 1997; Shaw et al. 1998) and 8p 
(Moises et al. 1995; Pulver et al. 1995; Kendler et al. 
1996; Schizophrenia Linkage Collaborative Group for 
Chromosomes 3, 6, and 8 1996; Kaufmann et al. 1998; 
Shaw et al. 1998; Wildenauer and Schwab 1998). 

Table 1 summarizes positive linkage findings for chro- 
mosomes 13q and 8p. The chromosome 13 linkage re- 
sults cluster within a region of ~12 cM in 13q32, except 
for the findings of Shaw et al. (1998), which are located 
within two regions, '-'15 cM centromeric on 13q31 (the 
findings presented in Table 1) and ^50 cM centromeric 
on 13ql2-ql3. For all of the chromosome 13 studies, 
results were obtained by use of narrow definitions of 
affected phenotype: schizophrenia only or schizophrenia 
and schizoaffective disorder. Studies that also evaluated 
evidence for linkage under broader phenotypic models 
found, in all cases, that a narrowly defined phenotype 
produced the strongest evidence for linkage to 13q (Lin 
et al. 1997; Straub et al. 1997; Shaw et al. 1998). Results 
were not as consistent with respect to recessive and dom- 
inant models of inheritance; some studies favored a dom- 
inant model, and others a recessive model (table 1). 
Maximum LOD scores under both inheritance models 
were not routinely reported, however, making intrastudy 
comparisons difficult. Interestingly, in the case of two 
studies using the same set of 54 families but different 
parametric-linkage methods and marker sets, Pulver et 
al. (1996) found greater support for linkage to 13q32 
under the dominant model of inheritance, whereas 
Blouin et al, (1998) found greater support under the 
recessive model of inheritance. 

The chromosome 8p linkage results shown in table 1 
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Table 1 

Previous Significant and Suggestive Linkage Findings of Schizophrenia to Chromosomes 
13q and 8p 



Model-Free 
Analyses 



Parametric Analyses 



Dominant 



Recessive 



No. OF 

Families 



Reference 



Chromosome 13q: 














P .00002* 


hLOD 


= L84- 


hLOD 


= 3.19- 


54" 


Blouin et al. (1998) 


P = .0002- 


hLOD 


= 3.24 


hLOD 


= 2.53 


54'' 


Pulver et al. (1996) 


LOD = 2.58" 


hLOD 


= 1.54- 






21 


Lin et al. (1997) 


P = .02- 






hLOD 


= 1.36- 


275 


Straub et al. (1997) 


P = .03- 


hLOD 


= 1.25 






70 


Shaw et al. (1998) 


Chromosome 8p: 












p = .ooor 


hLOD 


= 4.54" 


hLOD 


= 2.01- 


54" 


Blouin et al. (1998) 


P = .00004 


LOD = 


2.35 


LOD^ 


2.20 


57" 


Pulver et al. (1995) 


P = .006- 


hLOD 


= 2.34" 


hLOD 


= 2.52- 


265 


Kendler et al. (1996) 


P = .005- 


hLOD 


= .99 


hLOD 


= 2.22 


463"= 


SLCG (1996) 


P = .07- 






LOD = 


1.99 


70 


Shaw et al. (1998) 


P = .013=" 










30 


Kaufmann et al. (1998) 


P = .04 










5 


Moises et al. (1995) 



Note.— P = nominal P values; LOD = maximum homogeneity LOD score; and hLOD = 
maximum heterogeneity LOD score. 
- Multipoint analysis result. 

" The same 54 families were studied by Pulver et al. (1996) and Blouin et al. (1998); 46 
of these families overlapped with those studied by Pulver et al. (1995). 
" No known overlap with samples from other studies referenced. 



cluster within a region of -18 cM, centered around 
8p21, except in one study, in which positive results ex- 
tended over a 50-cM region (Kaufmann et al 1998). A 
recent meeting report summarized additional evidence 
in support of an 8p21 schizophrenia locus (Wildenauer 
and Schwab 1998), including a LOD score of 2.16, 
found by Curling and colleagues, under a dominant 
model of inheritance, using a set of 23 families from 
Iceland and the United Kingdom, and an increase in the 
maximum LOD score to 3.16, in the sample previously 
reported by Kendler et al. (1996). Most of the chro- 
mosome 8p studies reported analyses using a single, nar- 
row definition of schizophrenia (Moises et al. 1995; Pul- 
ver et al. 1995; Schizophrenia Linkage Collaborative 
Croup for Chromosomes 3, 6, and 8 1996; Blouin et 
al. 1998; Kaufmann et al. 1998). Of two studies that 
considered broader phenotypic definitions, one reported 
maximum LOD scores under a very broad diagnostic 
definition, with significant decreases in scores under nar- 
rower definitions (Kendler et al. 1996), and another 
found the strongest evidence for linkage on chromosome 
8p using a very narrow definition of disease (Shaw et 
al. 1998). As with the chromosome 13 findings, positive 
linkage results on 8p were seen under both dominant 
and recessive models of inheritance (table 1). Among 
studies reporting LOD scores for both dominant and 
recessive models, two studies, using overlapping sam- 
ples, reported greater support for linkage under the dom- 
inant model (Pulver et al. 1995; Blouin et al. 1998). In 
contrast, two other studies reported higher maximum 



LOD scores under recessive models of inheritance (Ken- 
dler et al. 1996; Schizophrenia Linkage Collaborative 
Croup for Chromosomes 3, 6, and 8 1996). 

Despite the differences in the samples used, the range 
of diagnostic and genetic models tested, and the variety 
of analytical approaches taken, the multiple findings of 
positive linkage support the possibility of schizophrenia- 
susceptibility genes located in relatively small regions of 
chromosomes 8p and 13q. Given the history of unre- 
plicated findings in the field of psychiatric genetics (Mol- 
din 1997), however, other significant linkage findings in 
studies using an independent sample of families would 
help establish these chromosomal regions as more de- 
finitively linked to schizophrenia. 

As part of an ongoing genome scan, the present study 
evaluated the evidence for linkage of schizophrenia to 
chromosomes 8 and 13 in a sample of 21 extended Ca- 
nadian families. Parametric-linkage analyses were con- 
ducted under both autosomal dominant and recessive 
models, using two diagnostic classifications, narrow and 
broad. We found significant linkage of schizophrenia to 
markers on 13q32, providing independent replication of 
significant linkage results for this chromosomal region. 

Subjects and Methods 

Subjects 

Canadian families of Celtic {n - 20) or German (?i = 
1) descent were recruited for the linkage study if schiz- 
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Table 2 

Maximum Pairwise LOD Scores for Schizophrenia and Chromosome 13 Markers 

Model 



Recessive-Narrow Recessive-Broad Dominant-Narrow Dominant-Broad 
Marker Hetero- 



Locus 


Position* 


zygosity'' 


LOD (HLOD') 




LOD (hLOD) 




LOD (hLOD) 


e{a) 


LOD (hLOD) 




D13S787 


0 


.80 


0(0) 


.4 (.75) 


0 


.5 


.53 


.3 


.28 


,4 


D13S1493 


17 


.75 


0(0) 


.5 (.05) 


0 (.07) 


.4 (.20) 


.76 


.2 


O(.ll) 


.5 (.10) 


D13S894 


24 


.68 


0 


.5 


.02 (.03) 


.4 (.50) 


.09 


.4 


0 


.5 


D13S325 


30 


.79 


0 


.5 


0 


.5 


0 


.5 


0 


.5 


D13S788 


37 


.85 


0 


.5 


0 


.5 


O(.Ol) 


.4 (.65) 


0 


.5 


D13S800 


47 


.74 


.65 (.65) 


.2 {.95) 


.06 


.4 


.74 (.96) 


.2 (.30) 


.21 (.46) 


.3 (.25) 


D13S317 


56 


.78 


.49 


.3 


0 


.5 


1.03 (1.11) 


.2 (.80) 


0 


.5 


D13S793 


68 


.77 


2.09 (2.10) 


.1 (.95) 


1.71 (1.93) 


.1 (.55) 


1.11 (1.52) 


.2 (.40) 


.07 (.08) 


.4 (.35) 


D13S779 


75 


.65 


.77 


.2 


1.78 (2.20) 


.2 (.50) 


.82 (1.09) 


.2 (.45) 


0 (.75) 


.5 (.20) 


D13S796 


86 


.81 


.31 (.83) 


.3 (.25) 


1.60 (1.84) 


.2 (.50) 


.45 


.3 


.08 (.74) 


.4 (.10) 


D13S285 


103 


.80 


0 


.5 


0 


.5 


0 


.5 


0 


.5 



Note. — LOD = maximum homogeneity LOD score; hLOD = maximum heterogeneity LOD score. 
" In Kosambi centimorgans. 

Determined from 30 unrelated family members. 
' For HOMOG analyses with a< 1. 



ophrenic illness appeared to be segregating in a unilineal 
autosomal dominant manner (Bassett et al. 1993; Bassett 
and Honer 1994). Twenty-one moderately large families 
(n = 285 subjects) were assessed, and 276 subjects had 
DNA samples available for the current study. All subjects 
were enrolled in this study after informed consent was 
obtained, and all procedures conformed to human-sub- 
jects protocols approved by the University of Toronto 
and Rutgers University. The average family size was 22 
individuals, with an average of 13 individuals per family 
participating. Most families consisted of two or three 
generations with only first- and second-degree relatives 
participating, although some large complex families con- 
tained relatives as distant as ninth degree. Direct inter- 
views — the Structured Clinical Interview for DSM-III-R 
(SCID-I) for major disorders and the SCID-II for per- 
sonality disorders — along with collateral information 
and medical records were used to make consensus di- 
agnoses on the basis of DSM III-R criteria. Further de- 
tails of the diagnostic and ascertainment methods have 
been described elsewhere (Bassett et al. 1993; Bassett 
and Honer 1994). The diagnostic classifications narrow 
and broad were used, with 71 and 107 affected indi- 
viduals in each category, respectively. Individuals were 
considered affected under the narrow diagnostic classi- 
fication if they were diagnosed with schizophrenia 
(n - 54) or chronic schizoaffective disorder (n - 17). In- 
dividuals were considered affected under the broad di- 
agnostic classification if they had been diagnosed with 
one of those disorders or with a nonaffective psychotic 
disorder (w = 13), schizotypal personality disorder (n = 
17), or paranoid personality disorder (w = 6). Diagnoses 
were available on all 285 subjects, and all individuals 



not categorized as affected under a given diagnostic 
scheme were classified as unaffected. 

Cenotyping 

DNA was extracted from blood samples or lympho- 
blastoid cell lines by use of the GenePure system (Centra 
Systems). DNA from each subject was genotyped by use 
of 11 chromosome 13 markers (table 2) and 19 chro- 
mosome 8 markers (table 3) from the Weber Screening 
Set, version 6.0, spanning chromosomes 8 and 13 at an 
average spacing of 10 cM. Two additional markers from 
chromosome 8, D8S560 and D8S298, were also geno- 
typed. Cenotyping was conducted in our laboratory and 
the laboratories of the Center for Inherited Disease Re- 
search (CIDR; Johns Hopkins University, Baltimore). 
Approximately 45% of all genotypes were generated by 
both laboratories. In our laboratory, genotypes were 
generated by PCR amplification incorporating radiola- 
beled dCTR Sample handling, PCR amplification, gel 
electrophoresis, and genotype-interpretation procedures 
have all been described elsewhere (Brzustowicz et al. 
1997). Genotype generation by CIDR used automated 
fluorescent microsatellite analysis (further details are 
available at the CIDR website). The error rate for the 
CIDR genotypes, calculated from 4,384 genotypes gen- 
erated for 12 blind duplicate pairs, was 0.08% per ge- 
notype. Seventeen genotypes (0.2% of the total used in 
this study), although concordantly scored on repeat ge- 
notyping within and/or between labs, nonetheless were 
not consistent with Mendelian inheritance. These cases 
were apparently due to microsatellite instability, but 
since the source of the original allele could not always 
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Table 3 

Maximum Pairwise LOD Scores for Schizophrenia and Chromosome 8 Markers 

Model 



Recessive-Narrow Recessive-Broad Dominant-Narrow Dominant-Broad 
Marker Hetero- 



Locus 


Position' 


zygosity'' 


LOD (hLOD') 




LOD (nLOD) 


e (of) 


LOD (nLOD) 


e{a) 


LOD (nLOD) 


d{a) 


D8S264 


0 


.85 


.73 (.74) 


.1 (.90) 


.26 


.3 


.35 


.3 


.30 (.38) 


.3 (.70) 


D8S1469 


15 


.82 


.03 


.4 


0 


.5 


0 


.5 


.14 


.4 


0851130 


21 


.71 


.57 (.61) 


.2 (.80) 


.11 (.12) 


.3 (.90) 


.52 (.53) 


.2 (.85) 


.32 (.54) 


.3 (.35) 


D8S1106 


25 


.69 


.72 


.2 


0 


.5 


0{.18) 


.5 (.15) 


.67 (.88) 


.3 (.45) 


0851145 


36 


.75 


.81 (.81) 


.2 (.95) 


.39 (.53) 


.3 (.30) 


.50 


.2 


.35 (.38) 


.3 (.75) 


O8S560** 


43 


.80 


1.33 


.2 


.37 


.3 


.82 


.2 


.38 (.40) 


.2 (.35) 


08S136 


43 


.80 


3.11 


.1 


.56 


.3 


3.49 


.1 


1.29 (1.31) 


.2 (.90) 


085298** 


44 


.72 


.90 


.2 


.98 


.2 


1.43 (1.47) 


.1 (.5) 


.87 (1.21) 


.3 (.55) 


08S1477 


59 


.81 


.40 (.66) 


.3 (.30) 


0 


.5 


.26 (.52) 


.3 (.30) 


.20 (.20) 


.4 (.75) 


D8S1110 


66 


.78 


.26 (.27) 


.3 (.15) 


0 


.5 


.70 


.3 


.17 


.4 


D8S1113 


77 


.78 


.06 


.4 


.08 


.4 


0(0) 


.5 (.20) 


0 


.5 


0851136 


81 


.70 


.09 (.09) 


.3 (.95) 


0(0) 


.5 (.50) 


.21 


.3 


0 


.5 


08S2324 


93 


.79 


.08 


.3 


.25 


.3 


.02 


.4 


0(0) 


.5 (.25) 


0851119 


100 


.83 


0 


.5 


0 


.5 


.10 


.4 


.06 


.4 


C8S14.2 


109 


.66 


.44 (.46) 


.2 (.75) 


0 


.5 


0 


.5 


0 


.5 


0851132 


118 


.87 


.05 (.11) 


.3 (.40) 


0 


.5 


.30 


.3 


0 


.5 


085592 


124 


.67 


,05 (.05) 


A (.60) 


.15 (.21) 


.3 (.15) 


.15 


.3 


.08 


.4 


0851179 


134 


.78 


0 


.5 


0 


.5 


0 


.5 


0 


.5 


0851128 


138 


.75 


0 


.5 


.06 


.4 


.32 


.3 


.05 (.05) 


.4 (.95) 


085256 


147 


.76 


.08 (.10) 


.3 (.25) 


0 (.06) 


.5 (.10) 


0 


.5 


0 


.5 


085373 


163 


.73 


.07 


.4 


.33 


.3 


0 


.5 


0 


.5 



Note. — LOD = maximum homogeneity LOD score; hLOD = maximum heterogeneity LOD score. 
■ In Kosambi centimorgans. 
^ Determined from 30 unrelated family members. 
' For HOMOG analyses with a< 1. 
Not part of the Weber screening set. 



be determined, we consistently deleted these ge- 
notypes from our analyses. 

Linkage Analysis 

Linkage analyses were conducted with the FASTLINK 
version 4. OP programs (Lathrop and Lalouel 1984; La- 
throp et al. 1984, 1986; Cottingham et al. 1993; Schaffer 
etal. 1994), Parametric-linkage analyses were conducted 
because they are more powerful than nonparametric 
methods (Burner et al. 1999) and are robust methods 
for detecting linkage despite errors or simplifications in 
the analyzing model, as long as both a dominant and a 
recessive model are used (Vieland et al. 1992, 1993; 
Hodge et al. 1997; Greenberg et al. 1998; Durner et al. 
1999). To minimize multiple tests, we selected four ge- 
netic models to analyze data from the genome scan — one 
dominant and one recessive for each of the narrow and 
broad diagnostic classifications. Because of difficulties in 
determining the appropriate correction for multiple tests 
when highly correlated diagnostic classifications are used 
(Lander and Kruglyak 1995), LOD scores are reported 
with no correction applied. Under the narrow diagnostic 
classification, the dominant model was schizophrenia 
susceptibility-allele frequency (p^) of .0045, with pen- 
etrance of disease (/) of .75, .50, and .001 for disease 



homozygotes (AA), heterozygotes (Aa), and normal ho- 
mozygotes (aa)^ respectively; the recessive model was 
= .065, f{AA) = .50, f(Aa) = .0015, and f(aa) = .0015. 
Under the broad diagnostic classification, the dominant 
model was = .007, f{AA) = .90, f{Aa) = .80, and 
f{aa) = .009; and the recessive model was p^ = .10, 
f(AA) = .60, f{Aa) = .01, and f{aa) = .01. Marker-allele 
frequencies for the analysis were estimated with a set of 
30 unrelated subjects from these families and were com- 
parable to those listed in the Centre d'Etude du Poly- 
morphisme Humain (CEPH) and Cooperative Human 
Linkage Center databases. Two-point linkage analyses 
were conducted by use of MLINK at recombination frac- 
tion {6) 0, .01, .05, .1, .2, .3, and .4. Multipoint analyses 
were conducted with LINKMAP. Three-point analyses 
were conducted with adjacent pairs of screening-set 
markers and the disease locus, with intermarker dis- 
tances derived from the Marshfield Map positions listed 
in tables 2 and 3. Additional markers flanking D8S136 
were used in four-point analyses with the following map 
order and recombination fractions: D8S560-.005- 
D8S136-.0001-D8S298. Heterogeneity testing was con- 
ducted with the HOMOG program (Ott 1986, 1991). 
The large size and complexity of the pedigrees in this 
sample (Bassett and Honer 1994) made multipoint link- 
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age analysis by the GENEHUNTER program (Kruglyak 
et al. 1996) infeasible because of loss of data. 

Results 

Under all models, two-point linkage analysis produced 
positive LOD scores with markers on 13q, but with sig- 
nificantly higher LOD scores under the recessive models 
(table 2). The maximum two-point LOD score of 2.09 
under homogeneity was obtained with the recessive 
model using the narrow diagnostic classification, at 
6 = .10 with D13S793. The maximum two-point LOD 
score under heterogeneity was 2.20 at ^ = .10 with 
D13S779 and a = .5, obtained under the recessive model 
using the broad diagnostic classification. A pattern of 
suggestive positive LOD scores spanning several adja- 
cent markers on chromosome 13q (D13S793, D13S779, 
and D13S796) was observed (table 2). Inspection of the 
linkage results by family revealed that the same subset 
of pedigrees was contributing to the positive linkage 
findings across this entire region. 

Under all models, two-point linkage analysis produced 
positive LOD scores with markers on 8p but with sig- 
nificantly higher LOD scores under the models using the 
narrow diagnostic classification (table 3). The maximum 
two-point LOD score of 3.49 was obtained with the 
dominant model using the narrow diagnostic classifi- 
cation, at ^ = .1 with D8S136, and was the same under 
homogeneity and heterogeneity. Positive but substan- 
tially lower LOD scores were obtained at markers on 
either side of D8S136 (table 3). 

Parametric multipoint analyses of complex disorders 
must be used with caution, since incorrect analysis mod- 
els can lead to distortions of the estimated map position 
of the disease locus and can exclude a true linked locus 
from the region between close flanking markers (Risch 
and Guiffra 1992). However, multipoint analyses are 
useful for combatting the practical limitations caused by 
inevitable uninformative marker typings, which can ei- 
ther inflate or deflate the LOD score. With large, com- 
plex pedigrees, analyzing more than two highly poly- 
morphic marker loci at a time can be computationally 
prohibitive when large regions of the genome are 
scanned for linkage. We therefore elected to conduct 
three-point analyses between adjacent marker loci and 
the disease locus. For chromosome 13, this produced, 
across all analysis models under homogeneity, a maxi- 
mum LOD score of 3.92 at 6 = .1 on the centromeric 
side of D13S793, under the recessive model using the 
broad diagnostic classification and the marker pair 
D13S793 and D13S779. In none of the three-point anal- 
yses under homogeneity was the maximum LOD score 
located in the interval between the two marker loci, an 
anticipated effect of multipoint analysis under an in- 
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correct model. Since one possible error in the model was 
the assumption of homogeneity, three-point results were 
also evaluated with HOMOG, for evidence of hetero- 
geneity. Although the hypothesis of heterogeneity was 
not statistically supported, favored by a likelihood ratio 
of only 3:1 ix^ ~ 2.3 ), the location of the maximum LOD 
score under heterogeneity did shift to the position of one 
of the marker loci. The maximum three-point LOD score 
under the assumption of heterogeneity was 4,42, with 
a =r .65 at a 0 = 0 with D13S793, again under the re- 
cessive model using the broad diagnostic classification 
and the marker pair D13S793 and D13S779 (fig. 1). 

For chromosome 8, the maximum three-point LOD 
score for the screening-set markers, under both homo- 
geneity and heterogeneity, was 3.51 at ^ = .1 on the telo- 
meric side of D8S136, under the dominant model using 
the narrow diagnostic classification and the markers 
D8S136 and D8S1477. This result was not significantly 
different from that obtained during the two-point anal- 
ysis, and the screening-set markers flanking D8S136 
(D8S1145 and D8S1477) did not demonstrate suppor- 
tive positive LOD scores (table 3). Since these findings 
could indicate a LOD score inflated by uninformative- 
ness of marker D8S136, two closely flanking markers 
(D8S560 and D8S298) were genotyped. The maximum 
two-point LOD score under both homogeneity and het- 
erogeneity with D8S560 was 1.33 at 6 = .2, under the 
recessive narrow model (table 3). The maximum two- 
point LOD score under homogeneity with D8S298 was 
1.43 at 6 = .1, under the dominant-narrow model, in- 
creasing to 1.47 at ^ = .01 and a = .5, under heteroge- 
neity (table 3). D8S560, D8S136, and D8S298 are very 
tighdy linked, spanning only 0.5 cM and with no evi- 
dence of recombination observed in this set of families. 



W 2 




Map Distance (cM) 



Figure 1 Three-point analysis of schizophrenia, with D13S793, 
and D13S779 under the recessive-broad genetic model, under ho- 
mogeneity and heterogeneity. D13S793 is at map position 0. a = ,65 
for the heterogeneity LOD score. 
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There was, therefore, no concern that using all three 
markers in four-point analyses would falsely exclude 
linkage from the region between these markers. The 
maximum four-point LOD score under homogeneity 
across all analysis models was 2.13 at 0 = .2 from 
D8S560, under the recessive model with the narrow di- 
agnostic classification (fig. 2). Results were nearly iden- 
tical when the dominant-narrow model was used (fig. 
2). There was no evidence for heterogeneity in the four- 
point analyses. 

Discussion 

To date, individual linkage studies of schizophrenia 
have produced mixed results, often characterized by sug- 
gestive but not significant results and by lack of repli- 
cation across studies (Moldin 1997). This study pro- 
duced evidence of a significant linkage on 13q32, as well 
as of a suggestive linkage on 8p21, <5 cM from the 
strongest previously reported linkages on both these 
chromosomes (Blouin et al. 1998). Furthermore, the re- 
sults independently establish linkage of schizophrenia to 
markers on chromosome 13q32, satisfying significance 
standards appropriate for a genomewide scan of a com- 
plex disorder. 

In this study, we have tested two modes of inheritance 
and two closely related diagnostic classifications. Lander 
and Kruglyak (1995) have suggested a LOD score of 3.3 
as the significance threshold for linkage studies of com- 
plex traits, noting that the appropriate correction for 
testing multiple, closely related models is not readily 
known. Morton (1998) has argued that a LOD of 3 
provides convincing evidence of linkage even for com- 
plex traits, as long as the assumptions of the test are not 
violated. Hodge etal. (1997) have demonstrated through 
simulation studies that a 0.3-LOD-unit correction for 
testing over two dominance models is conservative. 
Risch's (1991) proposal to subtract \og^o{t) for t tests 
from the obtained LOD score is a conservative correc- 
tion for analysis over multiple diagnostic or inheritance 
models and may be overly conservative, given the highly 
correlated nature of the two affected phenotypes in the 
current study. Nonetheless, applying a log,o(^) correction 
for the four tested models to the maximum LOD score 
of 3.92 for chromosome 13 markers obtained under the 
hypothesis of homogeneity would still produce a signif- 
icant corrected LOD score of 3.3. 

The autosomal recessive mode of inheritance was fa- 
vored in our analysis of 13q, as it was in the studies by 
Straub et al. (1997) and Blouin et al. (1998). This sup- 
port for an autosomal recessive mode of inheritance was 
an unexpected result, since one of our ascertainment 
criteria was a pattern of expression suggestive of auto- 
somal dominant transmission (Bassett and Honer 1994). 
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Figure 2 Four-point analysis of schizophrenia, with D8SS60, 
D8S136, and D8S298, under homogeneity and the dominant- narrow 
and recessive-narrow genetic models. D8S560 is at map position 0. 
Because of their close proximity, the positions of the three markers 
are indicated by a single point, on the X-axis, that spans map position 
0-0.5 cM. Also because of proximity, the plotted points for D8S136 
and D8S298 overlap and appear as a single point on the LOD-score 
plot. 

However, common recessive disease alleles in a popu- 
lation can produce pedigree patterns that appear to be 
autosomal dominant, underscoring the need for con- 
ducting parametric analyses under both dominant and 
recessive models for disorders with unknown modes of 
inheritance (Vieland et al. 1992, 1993; Durner et al. 
1999). 

Although the maximum LOD score for chromosome 
13 was obtained by use of the broad definition of af- 
fection, prior reports of suggestive and significant chro- 
mosome 13 linkages have been obtained by use of a 
narrow disease definition (Pulver et al. 1996; Lin et al. 
1997; Straub et al. 1997; Blouin et al. 1998; Shaw et 
al. 1998). Although some of these other studies did not 
perform an analysis under a broad definition of disease 
(Pulver et al. 1996; Blouin et al. 1998), those that did 
use multiple diagnostic classifications reported that the 
largest maximum LOD scores occurred under the nar- 
row definition of disease (Lin et al. 1997; Straub et al. 
1997; Shaw et al. 1998). However, since these studies 
did not also report the scores obtained under the broader 
diagnostic definitions, it is impossible to gauge the mag- 
nitude of the effect of diagnostic classification on max- 
imum LOD score. The maximum homogeneity multi- 
point LOD score under the recessive narrow model was 
3.08, at the same location as the recessive-broad mul- 
tipoint maximum of 3.92. These scores are significantly 
higher than the maximum homogeneity multipoint LOD 
scores obtained under the dominant model: 1.38 and 
0.28 with the narrow and broad definitions of disease, 
respectively. 
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The maximum LOD scores on 8p were similar under 
both the dominant- and recessive-narrow models (table 
3 and fig. 2), consistent with the results of Pulver et al. 
(1995) and Kendler et al. (1996). However, there was a 
striking decrease in LOD scores under the broad models. 
Most of the prior reports of linkage on 8p considered 
only a narrow definition of affection (Moises et al. 1 995; 
Pulver et al. 1995; Schizophrenia Linkage Collaborative 
Group for Chromosomes 3, 6, and 8 1996; Blouin et 
al. 1998; Kaufmann et al. 1998). Shaw et al. (1998) 
considered several diagnostic classifications and ob- 
tained the strongest evidence for linkage under a very 
narrow definition of disease. This is opposite to the effect 
described by Kendler et al. (1996), who reported di- 
minished support for linkage under a narrow diagnostic 
classification. Better definition of the aspects of the 
schizophrenia phenotype linked to this locus may help 
to clarify these results and to define a more appropriate 
genetic model for future linkage studies. 

Recent simulation studies have suggested that, as long 
as linkage analyses are conducted under both a recessive 
and dominant model, parametric approaches can have 
significant power to detect linkage of simple or complex 
disorders, with little loss of power even when the ana- 
lyzing model is constructed with an arbitrary disease 
penetrance (Hodge et al. 1997; Greenberg et al. 1998; 
Durner et al. 1999). Given this, it is not surprising that 
despite the slightly different genetic models used by dif- 
ferent investigators for their parametric-linkage studies, 
the same small regions of chromosomes 8 and 13 have 
been implicated in schizophrenia susceptibility, by a 
number of groups. It is also clear, however, that not all 
samples have produced suggestive or significant evidence 
of linkage to these regions (Coon et al. 1994; Kunugi et 
al. 1996; Harden and Morissette 1998; Faraone et al. 
1998; Levinson et al. 1998; Wildenauer and Schwab 
1998), possibly because of factors such as genetic het- 
erogeneity between samples, differences in diagnostic ap- 
proaches, or sampling effects. The results presented here, 
as well as the results presented by Blouin et al. (1998), 
demonstrate that significant linkage results can be ob- 
tained in studies of complex disorders when moderate- 
sized samples (<500 subjects) and parametric-linkage ap- 
proaches are used. Furthermore, these significant, repl- 
icable linkage results suggest that major-gene effects may 
be present in some samples of familial schizophrenia. 
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This paper is divided into two parts. Part I provides 
an overview of the epidemiology and genetics of 
schizophrenia and a description of the genetic strat- 
egies being pursued to identify schizophrenia suscep- 
tibility genes. Part II provides a description of the 
Johns Hopkins University Collaborative Schizophrenia 
Study that includes a summary of progress toward the 
identification and understanding of factors which con- 
tribute to schizophrenia susceptibility and to its 
heterogeneity. 

PART I: OVERVIEW OF THE EPIDEMIOLOGY 
AND GENETICS OF SCHIZOPHRENIA 

Epidemiology 

Schizophrenia is a frequent and disabling disease of 
variable expression and unknown cause. Symptoms in- 
clude hallucinations, delusions, lack of motivation, and 
anhedonia. Prevalence of the disease is estimated to be 
about 1% worldwide (Eaton 1985). Ordinarily the ill- 
ness develops in adolescence or early adulthood but 
has been observed to have onset in children as well as 
in later life (Gotlesman and Shields 1982). Gender dif- 
ferences can be observed in prevalence, age at onset, 
premorbid characteristics, clinical expression, course of 
illness, response to treatment, and family morbidity 
risk (for review, see Goldstein et al. 1989). In all 
likelihood, schizophrenia is etiologically heterogeneous 
(for review, see Tsuang and Faraone 1995). 

Risk Factors Associated with Etiology 

What risk factors do we know may be associated 
with cause? Clearly, a family history of schizophrenia 
increases risk to relatives, and this is the basis for the 
search for genes involved in susceptibility (Gottesman 
and Shields 1982). For example, first-degree relatives 
(parents, siblings, and children) of schizophrenics who 
share on average 50% of their genes have approxi- 
mately a 10% risk. Among first-degree relatives, the 
risk varies from 6% for parents, 10^^10% for siblings, 
and 13% for children. If one of the parents is also 
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schizophrenic, the risk to the siblings increases to 17%. 
Children of two schizophrenics have a risk of 46% of 
developing the illness. Risk to second-degree relatives 
is approximately 3%. In addition, it has been repeated- 
ly shown that concordance rates for schizophrenia are 
higher among monozygotic twins than among dizygotic 
twins (Gottesman and Shields 1982). 

Initially, deficits in attenfion were shown to predict 
the later manifestation of schizophrenia among the off- 
spring of schizophrenic parents (Erlenmeyer-Kimling 
et al. 1993), However, recently, data from a prospective 
study of a large birth cohort revealed that those indi- 
viduals who later developed schizophrenia showed 
deficits in a range of behaviors as early as age four 
(Done et al. 1994; Jones et al. 1994). This finding im- 
plies that childhood and early adolescent behavioral 
and attentional deficits may be meaningful risk factors 
for schizophrenia in the general population (i.e., not 
just for the offspring of schizophrenic parents). 

Other factors which may or may not have a genetic 
basis have been implicated as risk factors. These in- 
clude season of birth, perinatal or obstetrical complica- 
tions, and viral infection and/or extreme dietary restric- 
tion during the first trimester of pregnancy (for review, 
see Eaton 1991). With respect to season of birth, there 
is convincing evidence that a winter-spring birth date 
increases the risk for schizophrenia (for review, see 
Boyd et al. 1986). The basis for this association is un- 
known, but several hypotheses have been offered. 

Without knowing the basis for the association be- 
tween season of birth and the risk for schizophrenia, 
one can use season of birth to examine the hypothesis 
that schizophrenia is etiologically heterogeneous by 
looking for differences between winter-bom and non- 
winter-born schizophrenics which may imply dif- 
ferences in causal mechanisms. Studies using an epi- 
demiological sample of paUents hospitalized in the 
greater BaUimore area have shown that schizophrenic 
patients who were bom during the winter-spring 
months (February through May) differ from those 
born in other parts of the year by being more likely to 
have affected relatives (Pulver ei al. 1992b). In addi- 
tion, mothers of the winter-spring-born schizophrenic 
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patienis were more likely to have a longer interval be- 
tween their pregnancies than were the mothers of 
winter-spring-born controls (Pulver et al. 1992a). 
Other workers have reported that season of birth may 
be associated with specific clinical features of 
schizophrenia. For example, Katsanis el al. (1992) pro- 
vide evidence that winter-spring-born schizophrenics 
are more prone to dysregulation of eleclrodermal ac- 
tivity than are winter-born controls. Non-winter-born 
schizophrenics did not differ from non-winler-born 
controls with respect to electrodermal activity. In addi- 
tion, using a large sample of schizophrenic patients. 
Saccheiti et al. (1992) report that patients born during 
the winter-spring months show a higher frequency of 
enlargement of cerebral ventricles when compared to 
schizophrenic patients born during the rest of the year. 
The association between season of birth and 
ventricular enlargement was not present for patienis 
diagnosed with affective disorders. 

The role of obstetrical and perinatal events is un- 
clear, but there is overwhelming consistency in the data 
obtained in retrospective studies that implicate ihem as 
risk factors for schizophrenia (for review, see McNeil 
1995). More recently, data available from a prospective 
study of a large birth cohort did not find such an asso- 
ciation (Sacker et al. 1995). It remains unclear what 
specific role these factors play and how they interact 
with other risk factors such as genetic susceptibility. 
Susser and colleagues (1996), in a study of a cohort 
bom during the Dutch famine of 1944-45, found that 
exposure to extreme deficits in nutrition during the 
first trimester of pregnancy led to an increased risk for 
schizophrenia in the offspring. 

One final association should be mentioned. As 
reviewed by Eaton et al. (1992), results from several 
studies suggest a lower than expected rate of rheu- 
matoid arthritis among schizophrenics. The basis for 
the negative association between the two disorders is 
unknown. Several theories have been suggested, in- 
cluding nutritional, hormonal, and immunologic. 

A Neurodevelopmental H>pothesis 

It has been hypothesized that unspecified events, of 
genetic and/or environmental origin, acting eariy in fe- 
tal life may lead to aberrant development of cerebral 
structure and function, predisposing to schizophrenic 
illness later in life (Murray and Lewis 1987; Wein- 
berger 1987; Lewis 1989; Lyon el al. 1989; Murray and 
O'Callaghan 1991). Although the literature on struc- 
tural brain abnormalities is not altogether consistent, 
schizophrenic patienis as a group have been reported 
to have significantly larger ventricles and/or atrophy of 
the frontal regions when compared to controls (see, 
e.g., Cannon and Marco 1994). Specific regional 
changes affecting frontal, temporal limbic, and basal 
ganglia have been reported (Gur and Pearison 1993). 
Neuropathological studies indicate decreased cell num- 
bers and/or cyioarchitectural abnormalities in certain 



brain regions that suggest disturbed brain-cell migra- 
tion. The changes do not appear to be associated with 
degenerative or inflammatory processes and are most 
likely due to some neurodevelopmental lesion (Benes 
et al. 1986; Jakob and Beckmann 1986: Weinberger 
1987; Jones and Murray 1991; Roberts 1991; Benes 
1993; Keshavan et al. 1994; Marsh et al. 1994; Murray 
1994; Ross and Pearison 1996). 

Additional evidence to support the neurodevelop- 
mental hypothesis is thai, in some cases, enlarged 
cerebral ventricles are present at or before the onset of 
the illness, whereas in others there is reduced volume 
of both temporal lobe and hippocampus (Weinberger 
1987). Recent studies reported by Petty et al. (1995) 
have implicated the heteromodal cortex. Reversed 
asymmetry of the planum temporale was noted in a 
higher proportion of the schizophrenic subjects when 
compared to matched controls. In addition, both der- 
maloglyphic abnormalities and minor physical 
anomalies have been found at a higher rate among 
schizophrenic subjects than normal controls (Green et 
al. 1994). These findings provide additional support 
that differences between schizophrenics and the gener- 
al population are present in utero, 

A consensus is thus emerging that this illness occurs 
because of disturbances in intrauterine life that lead to 
structural and functional brain abnormalities. These 
could take the form of abnormalities of cell migration, 
failure to make connections, or reduced density of 
synapses, and clinical heterogeneity might vary accord- 
ing to the localization of the abnormality. There is rea- 
son to believe that the genetics of schizophrenia (or 
some forms of schizophrenia) is tied to the genetics of 
neurodevelopment, an observation that should help to 
focus efforts to pursue the elusive genetic basis of this 
devastating disease. 

In summary, our understanding of the epidemiology 
of schizophrenia is far from complete. Additional 
studies with large numbers of subjects are needed to 
determine the interrelationships among risk factors 
and clinically descriptive variables. Most of the studies 
have emphasized one or two variables, perhaps be- 
cause of limited numbers in study samples. Samples of 
sufficient size are necessary to resolve the remaining 
questions. The samples are only likely to be assembled 
through collaboration of several investigators. We have 
not been accustomed to such large-scale collaborative 
work; if we are to explore the heterogeneity of the dis- 
ease by associations of constellations of genes with 
clusters of clinical variables and other risk factors, the 
assembly of large samples of patients and families is 
necessary. 

Genetics 

Twin, adoption, and family studies implicate genetic 
factors in the etiology of schizophrenia (for reviews, 
see Gottesman and Shields 1982; Murray et al. 1986; 
Tsuang et al. 1991). The mode of inheritance is not 
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known (Risch and Baron 1984; Kendler and Diehl 
1993). In all likelihood, inheritance of susceptibility is 
complex, involving both genes and experiences (see, 
e.g., McGue and Gottesman 1991). Schizophrenia 
shows familial co-aggregation with schizoaffective dis- 
order, as well as with atypical psychotic disorders, 
schizotypal, avoidant, and paranoid personality dis- 
orders (Kendler et al. 1984, 1993). In some families, 
there is co-aggregation of both schizophrenia and 
bipolar disorder, which raises the possibility of some 
shared vulnerability (Taylor 1992). 

Genetic Strategies to Identify Schizophrenia 
Susceptibility Genes 

Linkage studies. Linkage studies test the association 
of a specific marker with disease status in a family. 
Linkage studies of schizophrenia date to as early as 
1973 when Elston tested for linkage between 
schizophrenia and alleles of the Gc and Gm loci (El- 
ston et al. 1973). Only within the past 6 years has a 
genome-wide search become possible due to the ad- 
vances in recombinant DNA and other techniques that 
have led to the availability of an increasing number of 
highly polymorphic markers located throughout the 
genome. Although these markers have made it techni- 
cally possible to complete the molecular analysis, other 
problems remain in studying complex diseases such as 
schizophrenia. The lack of sufficiently large numbers of 
well-characterized patients and their relatives, the need 
for more complex statistical analysis of the data, the re- 
quirement for new computer technology to allow effi- 
cient application of analyses to multiple variables, and 
the continuing need for even more markers in certain 
regions still limit the success of this work. 

Two linkage paradigms have been used: the 
genome-wide scan and a candidate gene or region 
approach. (1) Genome scan: The genome-wide scan, 
which provided an early success in identifying the gene 
responsible for Huntington's disease (Gusella et al. 
1983) is labor-intensive and costly. Results from 
several genome-wide scans have been reported (Barr 
et al. 1994; Coon et al. 1994b; Pulver et al. 1994c; 
Moises et al. 1995b). Despite the effort involved, none 
of these studies had enough families to establish con- 
clusive evidence either for linkage and subsequent pur- 
suit of more extensive cloning strategies or for reject- 
ing linkage under heterogeneity. However, several 
regions of interest were identified. Cumulative evi- 
dence from different populations is now available to 
suggest that schizophrenia susceptibility genes may be 
found in regions on chromosome 22q (Coon et al. 
1994a; Lasseter et al. 1995). chromosome 8p (Pulver et 
al. 1995), and chromosome 6p (Straub et al. 1995). Ad- 
ditional results from our group's genome-wide scan are 
reported in Part II of this paper. (2) Candidate genes: 
In schizophrenia, it is difficult to determine what genes 
should be considered candidates, given that the 
pathophysiology of schizophrenia is not known. Many 



hypothesized "candidate" genes have been tested in 
linkage studies. A list of hypothesized candidate genes 
that have been used in linkage and/or association 
studies is provided in Table 1. No consistent evidence 
has developed from this linkage approach. However, 
no candidate gene can be excluded and determined not 
to have a role in susceptibility to schizophrenia. 

A candidate region approach may be more produc- 
tive for schizophrenia. For example, structural chromo- 
somal changes in affected individuals, including trans- 
locations and deletions, may lead to the identification 
of a specific region where a susceptibility gene may lie. 
This approach proved rewarding in neurofibromatosis, 
where reports of translocation in two affected individu- 
als helped to localize the responsible gene (Fountain et 
al. 1989). There are several reports of structural chro- 
mosomal changes in schizophrenic subjects. A review 
of the literature was prepared by Ann Bassett (1992). 
Balanced and unbalanced translocations, inverted seg- 
ments, deleted segments, and fragile sites were 
reported to occur in a single patient or several patients. 
They were found throughout the genome (i.e., Iq32.3 
lq43, 2p11-2ql3, 2q21, 2p21, 5pl3-[5pter], 5qll.2. 
5ql3.3, 6ql4.2, 6ql5, 7pl2-[7pter], 8p23, 9plM3, 9p22 
10pl2-ldq21,llq25 [llqter] 13q21.2 13q3M3q32 17pl2 
18p 18q23 [18qter] 19pl3,21) (Bassett 1992). Since her 
review, additional translocations in schizophrenic 
patients have been reported: lp22 (Gordon et al. 
1994). 2pl 1.29 (Maziade el a). 1993), 5pl4.1 (Malaspina 



Table 1. Hypothesized Candidate Genes for Schizophrenia 
Examined in Linkage or Association Studies 



Candidate 


Location 


p-Amyloid precursor protein (APP) 


21q21.3-q22.05 


Catechol O-methyltransferase (COMT) 


22qll 


Debrisoquine hydroxylase (CYP2D6) 


22ql3.1 


Dopamine receptor 1 (DRDl) 


5q35.1 


Dopamine receptor 2 (DRD2) 


nq22-23 


Dopamine receptor 3 (DRD3) 


3ql3.3 


Dopamine receptor 4 (DRD4) 


npl5.5 


Dopamine receptor 5 (DRD5) 


4p 15.1-16.1 


Dopamine transporter (DATJ 


5pl5.3 


GABA^ receptor B 


14pl2-13 


Gtutamate receptor (GLUR5) 


21q22.l 


Neuropeptide Y{NPY) 


7pl5.1 


Neurotrophin-3 (NT-3) 


12pl3 


Porphobilogen deaminase (PBGD) 


llq23.2-qter 


Proopiomelanocortin (POMC) 


2p25 


Serotonin receptor lA (HTRl A) 


5qn.2-ql3 


Serotonin receptor 2A (HTR2A) 


13ql4-q21 


Serotonin receptor 2C (HTR2C) 


Xq24 


Serotonin receptor ID (HTRl DB) 


Ip36,3-p34.3 


Tyrosine hydrogenase (TH) 


llpl5.5 


Vesicular monoamine transporter (VMAT2) 


10q25 


Dopamine-p-hydroxylase (DBH) 


9q34 


Interleukin 2(lL-2) 


4q26-q27 


Monoamine oxidase (MAO) 


Xplt.23 


Marfan 


15q21.1 


Human leukocyte antigens (HLA)(DQBl) 


6p2l.3 
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et ai. 1992), llq25 (Holland and Gosden 1990). an in- 
version on 4pl5.2-q21.3 (Palmour et al. 1994), a 
trisomy of chromosome 5 (Rios et al. 1995), fragile 
sites at 8q24 10q24 (Garofalo et al. 1993), pericentric 
inversion of 9pll-9ql3 (Nanko et al. 1993), and a 
deleted segment of 22ql 1.21-qll.23 (Bland 1982; Can- 
non-Spoor et al. 1982; Pulver et al. 1994b; Lindsay et 
ai. 1996). 

In addition to the candidate regions described 
above, there is also an hypothesis that the susceptibility 
to schizophrenia may be related to mutations specific 
to the sex chromosomes (for review, see Delisi et al. 
1994a). Linkage/sib pair studies examining markers in 
the pseudo-auiosomal region of the X chromosome 
have been reported (see, e.g., Collinge et al. 1991; 
Asherson el al. 1992; D'Amato et al, 1992, 1994; Wang 
et al. 1993; Barr et al. 1994; Crow el al. 1994; Delisi el 
at. 1994b; Kalsi et al. 1995; Maier et al. 1995; Okoro et 
al. 1995). The evidence from these studies is not con- 
clusive; one cannot rule out the possibility of linkage of 
some region of the X chromosome with susceptibility 
to schizophrenia in some families. 

Using a linkage strategy to identify schizophrenia 
susceptibility genes, one cannot expect to be able to 
identify a specific locus; rather, these studies may iden- 
tify a region of interest within 1 cM of the gene (see, 
e.g.. Devlin and Risch 1995). Using a combined 
linkage/association strategy (the association study 
strategy is described below) is likely to have more suc- 
cess (Ghosh 1995). The application of tests for linkage 
is not straightforward, given some of the properties of 
this disease. Difficulties include: 

1. Late age at onset: Individuals may have the suscep- 
tibility gene but not exhibit the disease because they 
have not lived through the risk period. 

2. Variability in expression: Uncertainly remains about 
the definition of a case of schizophrenia. Several 
diagnostic systems can be employed (see, e.g., 
Feighner et al. 1972; Endicott et al. 1975; American 
Psychiatric Association 1994), each with somewhat 
different criteria for schizophrenia. Pooling samples 
may cause confusion because of these diagnostic dif- 
ferences. Inler-raler reliability, even when using the 
same diagnostic system, cannot be assumed. 

3. Incomplete penetrance has been assumed because 
concordance rates among monozygotic twins has 
been reported to vary between 40% and 60%. In ad- 
dition, a study of the offspring of discordant 
monozygotic twins suggests that the rate of 
schizophrenia did not differ when the offspring of 
affected and nonaffected co-twins were compared, 
providing support for the hypothesis of incomplete 
penetrance (Kringlen and Cramer 1989). Using an 
**affecteds only" design where unaffected individu- 
als have little impact on the analysis can circumvent 
this problem. 

4. Assortative mating (i.e., the tendency for mated 
pairs to be more similar for some phenotypic trait 



than would be expected if the choice of a partner oc- 
curred at random) has been demonstrated to be 
present among schizophrenic patients (Rao et al. 
1981; Parnas 1985, 1988). Nonrandom mating may 
result in a distortion of the distribution of genotypes 
in the offspring, and within family heterogeneity. 

5. Reduced fertility and shorter life span in those who 
are affected with schizophrenia limits the avail- 
ability of potentially informative pedigrees (see 
Rimmer and Jacobsen 1976; Black and Fisher 1992). 

6. Characteristics of affected individuals, such as suspi- 
ciousness and social withdrawal, may decrease the 
probability that they and their relatives will partici- 
pate in research. 

7. The mode of inheritance is not known: Given the 
lack of knowledge about how schizophrenia is trans- 
mitted (see, e.g., Risch and Baron 1984) non- 
parametric approaches, which do not make assump- 
tions as to mode of inheritance, in addition to 
parametric linkage analyses, are being conducted 
routinely by many groups (Fang et al. 1995). 

8. Given the probable etiological and genetic hetero- 
geneity in the populations studied so far (Garver et 
al. 1989), very large numbers of families are neces- 
sary if there is to be any statistical power in the tests 
for linkage and genetic heterogeneity (Kendler and 
Diehl 1993). 

These complications, while increasing the difficulties 
of the studies, are not altogether insurmountable. 

Association studies. In the study of complex dis- 
eases of unknown pathophysiology, linkage studies are 
useful for identifying regions of interest in multiplex 
families. Association studies can be helpful for testing 
known candidate genes and for narrowing regions of 
interest, once they are identified. No assumptions are 
made about the mode of inheritance. \n this strategy, 
association of a specific allele of a polymorphic locus is 
tested with the occurrence of the disease in affected in- 
dividuals compared to ethnically matched controls. 
This strategy has a practical advantage over that of 
linkage because relatively scarce multiplex families are 
not required, and it has considerable statistical power 
to detect genes of **small effect'* (A. Chakravarti, pers. 
comm.). The Transmission Disequilibrium Test (TDT) 
is a further refinement of the association test (Spiel- 
man et al. 1993), which tests the transmission of a par- 
ticular allele from a parent to the case, using the other 
untransmitted allele from the parent as the "control." 
and this test mitigates the need for ethnically well- 
matched controls. Differences in the distribution of al- 
leles between cases and controls suggest that the 
marker may be involved with the cause or may be in 
linkage disequilibrium with the cause. Several groups 
have reported results from such tests using polymor- 
phisms of genes presumed to have some role in 
determining susceptibility to other inherited traits such 
as blood groups, HLA serotypes, and group-specific 
components (see, e.g., Fananas et al. 1990, 1992; Saha 
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Table 2. Association Studies Which Provide Support for 
Further Investigations to Determine the Possible Role These 
Genes May Play in Schizophrenia Susceptibility 





Covariation 




Candidate 


detected 


Reference 


n L./\ UKjtil UJUJ 




Nimgaonkar et al. 






(1995) 


n LAV UyjDl LaXJ-1 






n LAV ClddS 11 




z.amani et al. (1994) 




ethnic 


Nimgaonkar et al. 






(1993b) 


Dopamine D-2 


age and 


Annami et al. (1994); 


receptor gene 


family history 


itokawa et al. (1993)^ 


i^updiTiinc u-j 


sex and age 


oritron ei ai. (lyyo); 


receptor gene 


difference 


Kennedy et al. (1995) 




family history 


Nimgaonkar et al. 






(1993a); 






Crocq et al. (1992) 


Serontonin 2A 




Inayama et al (1996) 


receptor gene 






Neurotrophin-3 


sex 


Dawson etal. (1995) 


gene 


severity 


Hattori and Nanko 






(1995); 






Nanko etal (1994) 


Prophobilinogen 




Sanders et al. (1992) 


deaminase gene 






Tyrosine 




Wei et al. (1995) 


hydroxylase gene 





^Structural change in DRD2-missense nucleotide mutation from C 
to G. 

et al. 1990). At this time, no genes have been con- 
firmed as either contributory or noncontributory to the 
risk for schizophrenia. In Table 2 we summarize inves- 
tigations that have provided some preliminary evi- 
dence of an association between a candidate and the 
susceptibility to schizophrenia (or some forms of 
schizophrenia). These candidates should be studied 
further in other populations. 

Summar}': Part T 

How are we most likely to make progress in under- 
standing the etiology of schizophrenia? It is by a com- 
bined epidemiological and genetic approach in which 
we go from one discipline to the other as each new 
level of information is exposed. For example, risk fac- 
tors may aggregate in such a way as to suggest a sub- 
type of schizophrenia. Then, when genes are found, 
their presence in families may be tested in the context 
of previously described subtypes, as different risk fac- 
tors may be associated with different mutations. Sub- 
types may be further refined by additional clinical de- 
scription. Finally, as a result of this alternating 
approach, we expect to end up with well-characterized 
genes whose protein products have been identified and 
which have been shown to be associated with predict- 
able constellations of clinical characteristics. It should 
be noted that in this analysis neither epidemiology nor 
molecular genetics is sufficient; both are necessary and 
both require equal care in the collection and manage- 



ment of the data. For example, failure in precision in 
the collection of the clinical data can confound the 
molecular work, however carefully that work may be 
done, and vice versa. In the remainder of this paper we 
describe such an approach. 

PART 2: THE JOHNS HOPKINS UNIVERSITY 
COLLABORATIVE SCHIZOPHRENIA STUDY 

The Maryland Epidemiology Sample 

The Maryland Epidemiology Sample (MES) was de- 
veloped to test the hypothesis that subtypes of schizo- 
phrenia represent the outcome of different causes rath- 
er than a variable response to the same provocation. 
The research goal was to describe the etiologic mech- 
anism for the different subgroups, assuming that dif- 
ferent mechanisms would call for different treatments. 
A combined epidemiologic-genetic approach was de- 
veloped in 1983 (see Fig. 1). Such an approach had not 
previously been mounted, although a similar strategy 
has been followed by Kendler and his colleagues study- 
ing an epidemiological sample in Ireland (Kendler et 
al. 1993). The need for large-scale studies was evident, 
given that reports of evidence of schizophrenia suscep- 
tibility were often inconsistent and seldom replicated. 

By screening all psychiatric admissions to 15 facil- 
ities in the greater Baltimore area between June 1983 
and April 1989, 1670 patients were chosen to partici- 
pate in the studies. Patients with a broad range of 
psychotic diagnoses were selected for a variety of rea- 
sons, including inevitable differences in diagnostic 
practices across facilities and the relevance of diagnosis 
to genetic susceptibility. Indeed, there is evidence that 
susceptibility genes may be shared across different 
diagnostic categories (Wildenauer et al,, this volume). 

Detailed clinical and family history information was 
acquired for the patients in the MES. Prior to dis- 
charge from the hospital, an in-person diagnostic inter- 
view was administered to the patients by a clinical 
psychologist or social worker. The interview examined 
both lifetime history and current episode. Patients 
were also asked to allow us to (1) contact them periodi- 
cally after discharge from the hospital for additional in- 
formation and enrollment in further studies. (2) obtain 
copies of their psychiatric records, (3) talk with at least 
two family members to ascertain the psychiatric status 
of their relatives, and (4) receive copies of their birth 
certificate and their birth records. A concern about the 
accuracy of a psychiatric diagnosis based solely on in- 
formation reported by the patient during the diagnostic 
interview led us to develop a best-estimate diagnostic 
procedure. This diagnosis was made by a psychiatrist 
who (1) reviewed all of the information reported by 
the patient during the diagnostic interview, (2) 
reviewed all of the patient's psychiatric records, and 
(3) spoke with the patient or an informant 6 months 
after the patient's discharge from the hospital to obtain 
course and outcome information. TTie diagnosis was 
made according to DSM-III criteria. 
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MARYLAND EPIDEMIOLOGIC SAMPLE 

(N = 1,670 patients) 



- Systemic ascertainment in one geographical region 

- Available information: 

Clinical Features 
Family History 
Lifetime Diagnosis 

• Ability to recontact patients over time 

• Collection of blood samples, MRI's, Neuropsychological 
measures, Dysmorphologic Rating 

■ Variety of psychotic diagnoses represented 




■ Back to epidemiologic 
sample -- for which 
patients are these 
genes important? 



Figure 1. Schematic of epidemiologic/genetic approach. 



For consenting patients in the MES (77% of the 
patients), a family history was obtained from two infor- 
mants judged by the patient to be knowledgeable 
about his or her family. During a telephone interview. 



the informants were systematically questioned about 
the medical and psychiatric histories of the patient's 
first- and second-degree relatives. We used modified 
Family History Research Diagnostic Criteria to classify 
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the relatives (Endicott et al. 1975). Since the family his- 
tory method has been shown to be less accurate than 
directly assessing individuals for the identification of 
schizophrenia (Andreasen et al. 1977), we considered 
these data as preliminary to the collection of the data 
necessary to classify the relatives of probands for 
molecular studies. However, the family history data 
were used to determine whether patient characteristics 
delineate subtypes of schizophrenia that may differ in 
their etiologies. The results of these subtyping analyses 
are presented below. 

Results of rnitial Subtyping Analyses 

The initial subtyping factors of interest were gender, 
age at onset, season of birth, and family history of 
schizophrenia. Utilizing family history information 
about the patients in the MES with a best-estimate 
diagnosis of schizophrenia, we have demonstrated the 
following: (1) first-degree relatives of female schizo- 
phrenics have a greater risk of developing schizo- 
phrenia than do first-degree relatives of male 
schizophrenics (Wolyniec et al. 1992); (2) first-degree 
relatives of schizophrenics with an early onset of illness 
(less than age 17) are at greater risk of developing 
schizophrenia than are first-degree relatives of 
schizophrenics with a later onset of illness (Pulver et al. 
1990; Boyle et al. 1991); (3) first-degree relatives of 
schizophrenics born during the winter-spring months 
(specifically, February through May) are at higher risk 
of developing schizophrenia than are relatives of 
schizophrenics born during the other months of the 
year (Pulver et al. 1992b); (4) mothers of winter- 
spring-born schizophrenics have a greater spacing be- 
tween iheir live births than do mothers of winter- 
spring-born controls (Pulver et al. 1992a); (5) the first- 
degree relatives of schizophrenic patients who also 
have a history of manic symptoms are at an increased 
risk for developing schizophrenia (Liang and Pulver 
1996) than are the relatives of schizophrenic patients 
who do not have manic symptoms, and (6) the first- 
degree relatives of schizophrenic patients who also 
have a history of obsessive and compulsive symptoms 
are at a greater risk for depression and suicide than are 
the relatives of schizophrenics who do not have obses- 
sive and compulsive symptoms (Samuels et al., in 
prep.). 

On the basis of these results, we suggest that some 
individuals at an increased risk for schizophrenia due 
to genetic background may also be more vulnerable to 
some seasonally varying factor which further increases 
their risk (e.g., viral exposure in utero, pregnancy and 
birth complications). The greater spacing between the 
births of the mothers of winter-spring-born schizo- 
phrenic patients suggests that there might be other 
complicated pregnancies resulting in a nonviable out- 
come. It is important to determine whether the clinical 
features of schizophrenia which are associated with fa- 
milial risk (i.e., age at onset, manic symptoms, obses- 



sive-compulsive symptoms) characterize families, or 
some subset of families, or sibships. If this is the case, it 
will provide additional support for the idea that schizo- 
phrenia is etiologically heterogeneous, and these par- 
ticular covariates may be helpful in discriminating be- 
tween subgroups of patients in molecular studies. 

Genetic Linkage Study 

In 1989, a second phase of the work began. Here, 
the goal was to ascertain a set of families in which 
more than one individual was affected with schizo- 
phrenia (multiplex families) to be used to conduct a 
genome-wide search for genes and DNA markers asso- 
ciated with the risk for schizophrenia. The first source 
of such families was the MES. The families of schizo- 
phrenic patients who, on the basis of family history 
information, had at least one other living first- or 
second-degree relative with a diagnosis of either 
schizophrenia, schizoaffective disorder, or unspecified 
functional psychosis were considered eligible for this 
study. Less than one third of the schizophrenic patients 
in the MES had one or more affected relatives (for a 
description of the occurrence of schizophrenia in the 
relatives, see Pulver and Bale 1989). If direct assess- 
ment of the second affected family member confirmed 
a diagnosis of schizophrenia or schizoaffective dis- 
order, complete ascertainment of the family was 
pursued. Complete ascertainment included direct 
psychiatric examinations and blood drawings for all of 
the proband's first- and second-degree relatives, as well 
as all of the first-degree relatives of other affected indi- 
viduals in the family. Spouses of affected individuals 
were also included. Because of the need for greater 
statistical power in linkage studies, additional multi- 
plex schizophrenic families were also recruited through 
physician referrals. The nonsystematically ascertained 
families were more likely to have at least three affected 
individuals. 

Design features of the linkage study. It is our belief 
that quality control measures must be built into alt 
aspects of data collection and analysis in a study of this 
kind. Some of the key design features of the linkage 
study include: 

1. The majority of the families in the set used for the 
genome-wide scan were ascertained through a sys- 
tematically selected group of patients (the MES). 
The set of families can be characterized by as- 
certainment class— either systematic or oppor- 
tunistic. 

2. The ascertainment strategy was specified in advance 
of data collection. We attempted to acquire infor- 
mation about all first- and second-degree relatives 
of affected individuals in each family. This informa- 
tion was gathered through any or all of the follow- 
ing: reports from family informants, review of 
psychiatric treatment records, direct psychiatric ex- 
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aminaiions, and reports from treating therapists and 
psychiatrists. 

3. Rules for extending the pedigrees were specified; 
pedigrees were extended through affected individu- 
als. 

4. Direct psychiatric examinations of relatives were 
made by clinical psychologists or psychiatrists using 
standard psychiatric assessment instruments. The 
psychiatric assessments were designed to ascertain 
symptoms relevant to major mental disorders, as 
well as severe personality disorders. Information 
reported by the subject was supplemented by inter- 
viewing an informant identified by the subject. Ex- 
aminations were audiotaped with the permission of 
the subject for quality control purposes. 

5. To increase the accuracy of the assignment of af- 
fected status, final diagnoses were formulated 
through a consensus procedure. For each subject, 
two psychiatrists or psychologists independently 
reviewed all available information including the 
diagnostic interview, information reported by infor- 
mants, psychiatric treatment records, and informa- 
tion reported by the subject^s treating physician, 
and, after review, completed a DSM-HI-R diag- 
nostic assignment. If the two assignments agreed, 
the diagnosis was considered final. If there was dis- 
agreement, the case was discussed by the two raters 
until a consensus was reached. If a consensus could 
not be reached, additional information was sought 
and/or a third diagnostician was asked to review the 
case and to try to resolve the discrepancies. In some 
instances, this was not possible and the case was 
then classified as affected status unknown. Con- 
sensus was necessary not only for the diagnoses as- 
signed, but also for age-at-onset ratings for each 
diagnosis. 

6. The definition of affected and unknown phenotypes 
was specified prior to the analysis. This was done so 
that specific hypotheses could be tested rather than 
fishing for a positive result. For the analyses in the 
genome-wide search, individuals with a consensus 
diagnosis of either schizophrenia or schizoaffective 
disorder were considered affected; individuals with 
other psychotic disorders or schizotypal, schizoid, or 
paranoid personality disorder were classified as un- 
known, 

7. To control for possible bias in coding genotyping 
data, molecular genetics laboratory personnel were 
blind to the affected status of all DNA samples in 
the laboratory, and all genotyping data were coded 
after consensus of two readers. Similarly, the diag- 
nosticians were blind to the genotyping data. In ad- 
dition, the genetic analysts were not involved in 
clinical or molecular assessments. 

8. Given the variable age at onset of many psychiatric 
diseases, and the fact that many relatives have not 
lived through all the known risk periods for disease, 
a longitudinal study design was implemented to al- 
low us to maintain current affected status of all indi- 



Table 3. Description of 54 Pedigrees 



Total number of individuals 456 

individuals genotyped 363 

Affected individuals 147 
age at onset 

<35 125 

^35 17 

average 24.6 
sex 

male 90 

female 57 
DSM-III-R diagnosis 

schizophrenia 143 

schizoaffective 4 
average number of affecieds 

in each family 2.7 

Unaffected individuals 246 
age at assessment 

<35 30 

^35 226 
Individuals with diagnosis "unknown" 63 
Linearity in pedigrees (re: proband) 

maternal 19 

paternal g 

sibsonly 27 



(80%) 



(85%) 
(15%) 



(61%) 
(39%) 

(97%) 
(3%) 



(12%) 
(88%) 



(35%) 
(15%) 
(50%) 



viduals in the pedigree through regulariy updated in- 
terviews. 

Description of the families being genotyped for the 
genome-wide scan. At this time, we have completed 
the diagnostic evaluations of more than 80 families. 
For the past 18 months, we have been genotyping 363 
individuals in 54 pedigrees. Additional pedigrees are 
being added as we continue the research. However, at 
this time, all analyses are based on 54 families. Charac- 
teristics of the families are summarized in Table 3. 

Strategy for the genome-wide search. Given the 
likelihood that no one gene will have a sufficient effect 
in all of the families, simulation studies revealed that 
the number of families we had available was insuffi- 
cient to detect linkage for a gene contributing to 
liability in only a small proportion of families (under 
heterogeneity). We embarked on our studies knowing 
that collaborative efforts with other research groups 
were going to be necessary to detect genes that may be 
important to subsets of the schizophrenic families. 
Therefore, a three-stage strategy was developed; the 
first two stages involved typing markers in the families 
we ascertained, the third stage was to help facilitate the 
study of the regions that met criteria in other family 
samples (by other investigators) (see Table 4). 

In analyses of our 54 families, low positive LOD 
scores were used as thresholds to detect regions of in- 
terest from a sparse linkage map. Additional markers 
were typed to create a dense map only at promising 
genomic regions. The staged approach was used to 
minimize the number of markers required for typing. 
An initial critical LOD score for stage I was selected 
that was highly likely to include any true linkage 
(LODa 0.30). The strategy was to space markers every 
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Table 4. Three-stage Searching Strategy 

Stage I. 2-Point linkage analyses 

54 JHU families 

Find markers where LRH ^ 2.0 

(LOD a 0.30) 
Stage 2. 2-Point linkage analyses 

54 JHU families 

Find markers where LRH a 10.0 

(LOD i 2.0) 
Stage 3. "Hot spot" 

Facilitate follow-up of region by other investigators 

in independent families 



20 cM. adding additional markers (every 3 cM) to any 
selected regions of 40 cM. The critical LOD score for 
stage II was selected to be greater than or equal to 2.0. 
Any regions meeting the stage II criterion were then 
released to other investigators for independent replica- 
tion (stage III). 

Statistical analysis. Affected subjects for all statisti- 
cal analyses were restricted to those satisfying DSM- 
III-R (American Psychiatric Association 1987) criteria 
for schizophrenia or schizoaffective disorder. Subjects 
with other psychotic disorders or **schi20phrenia spec- 
trum personality disorders" were classified as 
"unknown" phenotype with unknown liability. Both 
autosomal dominant and recessive models were used. 
Given that the concordance rate for schizophrenia 
among monozygotic twins has been reported to be be- 
tween 50% and 65%, the classification of those who 
are unaffected is problematic. Therefore, the two 
parametric models are defined with ''affecteds only" 
liability classes, where unaffected subjects were treated 
the same as unknown subjects; their genotypings are 
available, but their phenotype has no effect on the 
LOD score. In addition to linkage analyses using two 
models, nonparametric sib-pair analyses to lest for 
sharing of alleles identical-by-descent were conducted. 
However, the results from linkage analyses using the 
autosomal dominant and recessive models were used 
to guide the laboratory through the two-stage strategy. 

The programs MLINK, LODSCORE, ILINK 
(Lathrop and Lalouel 1984: Lathrop et al 1984, 1986), 
and HOMOG (Ott 1991) were used, via the FAST- 
LINK versions (Cottingham et al. 1993; Schaffer et al. 
1994) as implemented in the ANALYZE software 
(Terwilliger 1994) to compute 2-point maximum LOD 
scores under heterogeneity, for each locus tested. The 
ANALYZE software also estimates the proportion of 
alleles shared identical-by-descent in all available sib 
pairs (Kuokkanen et al. 1996; Salsangi et al. 1996). In 
addition, within the past several months, a novel and 
efficient way to run nonparametric multipoint analyses 
for many markers in a dense map has become avail- 
able. Some of the data have been analyzed with these 
programs, known as MAPMAKER/SIBS (Kruglyak 
and Lander 1995) and GENEHUNTER (Kruglvak et 
al. 1996). 



Results from the Genome Scan 

We have typed and analyzed more than 650 
markers, on all 22 autosomes, in the genome scan. We 
approximate that these markers cover 70% of the 
genome. Using the staged strategy, we have identified 
four regions which we felt were worthy of follow-up. 
These regions are 22qll-ql3; 8p22- p21; 3p26-p24; and 
13ql4-q32. The results of analyses for each of these 
regions, as well as the data reported from other groups 
for these regions, are summarized below. In addition, 
we report the results of analyses of data generated to 
follow up a report from Straub et al. (1995), suggesting 
linkage on chromosome 6p24-p22. 

Chromosome 22q, The region of interest on chro- 
mosome 22 is shown in Figure 2 (Pulver et al. 1994b). 
This is the first region identified in our genome scan, 
satisfying criteria for stage II. The maximum LOD 
score was obtained for the dominant model (LOD - 
2.05) for the marker D22S445 in 22ql3. The highest 
LOD score obtained using the recessive model was 
1.39 for D22S685 mapped to 22ql2. The most sig- 
nificant sib-pair sharing was found for the markers 
D22S1268, D22S686 (p=.007) in 22ql2.ql3. The region 

Region 22q 




Figure 2- Ideogram of chromosome 22. 
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of allele sharing is relatively large. These data alone 
provide only suggestive evidence for a susceptibility 
gene somewhere in this region on chromosome 22. 
Recently completed multipoint analyses confirm that 
the region of interest is large and may, in fact, contain 
two separate regions of interest. 

Prior to publishing the initial suggestive findings for 
this region, we organized a multicenter collaborative 
effort to investigate the region further. This was the 
first such effort to be taken in this field. Previous expe- 
rience with a report ^'Localization of a Susceptibility 
Locus for Schizophrenia on Chromosome 5" (Sher- 
rington et al. 1988), followed by a series of papers 
reporting no evidence for linkage (Kennedy et al. 1988; 
St. Clair et al. 1989; Aschauer et al. 1990; McGuffin et 
al. 1990; Crowe et al, 1991; Campion et al. 1992; Mac- 
ciardi et al. 1992), was problematic for the field of 
psychiatric genetics (see, e.g., Owen 1992). The early 
pronouncement of a finding of a schizophrenia suscep- 
tibility gene followed by many failures to replicate 
caused disillusionment with the scientific goal and ad- 
versely affected the reputation of psychiatric genetics 
among the general scientific community. Prior to publi- 
cation of our results on 22q (Pulver et al. 1994b), three 
research groups known to be genotyping families with 
schizophrenia were contacted in 1992 and advised of 
the findings; i.e., Kenneth Kendler and his colleagues, 
Michael Owen and Michael Gill and their colleagues,' 
and Jacque Mallet and his colleagues. A collaboration 
was begun; interlaboratory reliability exercises were 
conducted for both the genotyping data and the diag- 
nostic assignments. 

As a result of the collaborative study, 217 additional 
families were genotyped for three markers on 22q 
(D22S268, JL2RB, and D22S307), The LOD score 
analyses for this independent sample of families did 
not support linkage (Pulver et al. 1994c). However, 
since power calculations showed that there was lovJ 
power to detect linkage if 25% or less of the families 
were linked to markers in this region, we concluded 
that additional efforts needed to be mounted to test 
the hypothesis that there was a schizophrenia suscep- 
tibility gene in this region in some families. After we 
made our findings public, Coon and colleagues notified 
us that as part of their genome search using 9 families, 
they detected a LOD score of 2.09 for a recessive 
model toward the distal end of the chromosome at 
D22S276. Subsequently, their data were published 
(Coon et al. 1994a). Since then, several additional 
groups genotyped markers in the region. Although all 
of the reports have not supported linkage, none of 
them has been able to rule out the possibility of linkage 
for markers in this region for some subset of the 
schizophrenic families. Additional support has accrued: 
(1) Gill et al. (1996) reported their best evidence at 
D22S278 in 22ql2 (p<.01); (2) Schwab et al. (1995b) 
reported a mildly positive LOD (0.61) for D22S304 
which is approximately 3.5 cm centromeric to 
D22S278\ (3) Polymeropoulos et al. (1994) reported 



greater than expected allele sharing for several 
markers in the region around D22S278 (p<.02); (4) 
Moises et al. (1995a) reported some evidence for 
linkage disequilibrium between schizophrenia and al- 
lele #243 of D22S278 (p=Om) in 113 unrelated 
schizophrenic subjects and their 226 parents; and final- 
ly (5) a collaborative effort organized by Michael Gill 
and Homero Vallada et al. (Gill et al. 1996) employing 
an affected sib-pair analysis analyzed genolypings of 
one marker (D22S278 in 22ql2) typed by 11 research 
groups. From a pooled sample of more than 600 
families, the analysis of the sub sample of 296 affected 
sib-pairs for whom parental data were available 
revealed a significant excess of alleles shared by af- 
fected individuals for the marker D22S278 (p=.001 ). 

In addition to the evidence for the region around 
D22S278, there is some evidence for a schizophrenia- 
related susceptibility locus centromeric to this region. 
The evidence is as follows: (1) Although for some time 
it had been observed that children with velo-cardio fa- 
cial syndrome (VCFS), a congenital disorder associ- 
ated with an interstitial deletion of 22ql 1 , have a range 
of psychiatric and learning problems, a study using ap- 
propriate research methods found that by examining 
the psychiatric status of VCFS subjects who were be- 
tween the ages of 16 and 25, a higher than expected 
rate of schizophrenia was found (Pulver et al. 1994a). 
(2) On the basis of the evaluation of 100 schizophrenic 
patients ascertained from the MES, we have reported a 
higher rate of the 22qn deletion among the schizo- 
phrenics than would be expected based on estimated 
population prevalence (Karayiorgou et al. 1995; Lind- 
say et al. 1995). Details of this work as well as the 
results of additional work in this region are reported by 
Karayiorgou et al. (this volume). 

Chromosome 8p22-pl2. The region of interest on 
chromosome 8 (Pulver et al, 1995) is shown in Figure 3. 
The dominant model provided a maximum LOD score 
in this region for the marker SFTP2 (2.40) in 8p21 . The 
maximum LOD score for the recessive model was 2.10 
for the marker D8S136, tightly linked to SFTP2. The 
most significant evidence for a susceptibility locus in 
this region from the analyses of allele sharing among 
sib pairs is for the marker D8S258 (^<.001). Results of 
multipoint sib-pair analyses recently completed pro- 
vide additional support for this region. A report of 
these analyses is currently being prepared for publica- 
tion. 

Given the strength and the consistency of these 
data, we requested that colleagues again join in a col- 
laborative effort to study this region (stage III of our 
strategy). Douglas Levinson agreed to coordinate a 
study of 8p (and two other regions) using family data 
from 14 groups with a total of 567 informative 
pedigrees. The results of the analyses of five markers 
on 8p (D8S26I, D8S258, D8I33, D8I36, and D8S283) 
have been completed and provide additional support 
for the presence of a schizophrenia susceptibility gene 
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Region 8p22-p1 2 Region 3p25-p24 




Figure 3. Ideogram of chromosome 8. 

in this region (D. Levinson et al.; Schizophrenia Col- 
laborative Linkage Group; both in prep.). The maxi- 
mum LOD score 2.22 (p=.0014) for 13 groups (without 
our data) was obtained with a recessive model for locus 
D8S26U the most lelomeric of the five markers tested. 
The nonparametric analyses report p<.005 for D8S133 
for allele sharing in the independent replication 
sample. Slightly positive LODs (>0.3) were observed in 
10 of 13 groups. 

Chromosome 3p26-p24, Figure 4 shows the region 
3p26-p24. a region of approximately 60 cM mapped by 
22.dinucleotide-repeat polymorphisms. The maximum 
LOD score in this region was obtained with an 
**affecteds only" dominant model for the marker 
D3SJ283 (LOD score 1.56). The maximum LOD score 
in this region using the affecteds-only recessive mode! 
was also at D3SI283 (LOD score 1.75). Reanalyses 
with our current sample, and additional markers, now 
show that only one marker, D3SJ283, shows evidence 
(p=.02) of excess sib-pair sharing. Markers in this 
region of chromosome 3 were also genotyped in the 
replication effort coordinated by Levinson. All hetero- 
geneity LODs for four markers used in the follow-up 
effort (D3SI293, D3SJ283, D3SI266, D3SI298) were 




Figure 4. Ideogram of chromosome 3. 

less than 0.08 and the region was therefore considered 
negative. However, these data do not provide evidence 
to exclude the possibility of a schizophrenia suscep- 
tibility gene in the region in some individuals. 

Chromosome 13ql4'q32. The region of interest on 
the chromosome is 13ql4-q32 and is shown in Figure 5. 
The greatest evidence for this region comes from the 
sib-pair analyses for the marker D13S770 (p=.0002). 
The maximum LOD score in this region was obtained 
with a dominant model for the marker DI3SI28 (LOD 
= 3.24). The maximum LOD score for the recessive 
model was 2.53 for the marker D13S779. Recently 
completed multipoint sib-pair sharing analyses for this 
region add support for a schizophrenia susceptibility 
locus in this region. A more dense map of this region is 
currently being genotyped. 

Other evidence to support the existence of a suscep- 
tibility gene for schizophrenia in region I3q had pre- 
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viously been reported by Lin et al. (1995) in their study 
of 13 schizophrenia pedigrees in the region 13ql4.1. 
q32. LOD score analysis of markers in this region was 
moderately positive (DI3SN4; LOD=1.61). Multipoint 
results from a dominant model allowing for hetero- 
geneity using a narrow definition of affected gave a 
maximum multipoint LOD score of 2.00 for DI3SI28 
in 13q32, 

Chromosome 6p. As part of the 3-stage strategy, we 
also follow up reports of regions of interest from other 
groups. Kendler et al. informed us of their finding of a 
maximum LOD score of 3.51 for D6S296 in their large 
Insh sample of nearly 300 pedigrees (Straub et al. 
1995). There was also suggestive but weaker evidence 
for sib-pair sharing of alleles at D6S296, We have fol- 
lowed up this suggestion of linkage by typing markers 
m both 6p24-22 and in the more centromeric HLA 
region (6p21) (Antonarakis et al. 1995) (see Fig. 6). 
The strongest sib-pair sharing in the sample was seen 
for marker D6S296 (p=.0003). The marker D6Si006 
gave the highest LOD for the dominant model (0.96) 
and also gave the highest recessive LOD (1.32). 

Additional evidence supporting the existence of a 
schizophrenia susceptibility has also been obtained in 
s^eral samples (Curling el al. 1995; Schwab et al 
1995a; Wang et al. 1995) and the large collaborative ef- 
fort organized by Levinson et al. (1996). These results 
of the analysis of the collaborative effort as well as ad- 
ditional work in this region by the Kendler group are 
reported by Straub et al. (this volume). 




06S429 D6S202 



D6S260 



Figure 6, Ideogram of chromosome 6. 
Summary: Part II 

The results of our epidemiologic studies support the 
hypothesis that schizophrenia is etiologically hetero- 
geneous. The true lest of heterogeneity will only be 
possible when the specific etiologic mechanism(s) is 
delineated. The identification of schizophrenia suscep- 
tibihty genes is an important step toward attaining this 
goal. 

Over the last several years, significant progress has 
been made toward elucidating the genes associated 
with schizophrenia susceptibility. Our group has now 
reported four regions of interest from the genome scan 
(Ke., chromosomes 22q 11-13, 8p22-pl2, 3p26-24, and 
13ql4-q32). Multicenter follow-up studies have now 
been completed for three of the regions (22q, 8p, and 
3p), and evidence to support linkage has been obtained 
for two of the regions ( 22q and 8p). The region on 3p 
although studied in a large collaborative follow-up 
study, did not provide evidence supporting linkage. 
This paper represents our first report of the region on 
chromosome 13q. We are hopeful that a collaborative 
follow-up will be organized to test this region. In addi- 
tion to these regions, the cumulative evidence now 
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available for the region of interest identified by Kend- 
ler and colleagues (Straub et al. 1995) on chromosome 
6p24.p22 provides support for the existence of a sus- 
ceptibih'ty gene in this region as well. 

In all likelihood, over the next several years we will 
learn of the existence of other regions of interest and 
other potential candidate genes. However, the identi- 
fication of regions of interest is an initial step toward 
the next goal: the identification of the gene(s). The 
problem that we and others are now struggling with is 
how to move from a region of interest or a candidate 
region to the identification of a gene. Our group has 
articulated a plan, described below, to narrow the 
regions of interest. This plan requires an additional 
huge effort from scientists representing all of the dis- 
ciplines in our multidisciplinary team. 

Association/linkage disequilibrium tests. As a first 
step to follow up linkage results in family samples, as- 
sociation studies are conducted using a new sample of 
schizophrenic patients and their parents from simplex 
families in the epidemiologic sample; these families 
have not previously been analyzed in linkage studies. 
We are also planning a new data collection effort; i.e 
to ascertain a sample of Ashkenazi Jewish schizo- 
phrenic patients and their families for both sib-pair 
studies and association studies. This new sample will be 
tested to determine if the previously identified regions 
of mterest are important in this more homogeneous 
group, as well as to identify new regions of interest In 
regions where the TDT and/or association studies con- 
firm disequilibrium, family material can be used for 
molecular cloning strategies. 

Refinement of disease definition/ heterogeneity 
analyses. All of the analyses that we have reported 
thus far have been based on a strict definition of af- 
fected; i.e., a consensus diagnosis of schizophrenia or 
schizoaffective disorder. Additional analyses are 
planned that will expand the definition of affected to 
include what has previously been described as 
schizophrenia spectrum disease (schizotypal, schizoid, 
and paranoid personality disorders and unspecified 
psychotic disorders). 

It has long been recognized that there is consider- 
able clinical heterogeneity in schizophrenia. Recently 
several independent studies employing multivariate 
techniques such as factor analysis and latent class anal- 
ysis have consistently suggested that the symptoms of 
schizophrenia can be grouped into three dimensions- 
negative symptoms (affective flattening and negative 
thought disorder), disorganization (inappropriate af- 
fect and positive thought disorder), and positive 
psychotic symptoms (e.g., hallucinations) (for review 
see Andreasen et al. 1 995). The consistency of the find- 
ings across studies, and the presumed etiologic hetero- 
geneity in schizophrenia, suggest that these dimensions 
may be good candidates to be used in the definition of 
affected m linkage studies. It- mav be that genetic 
heterogeneity is tied to the clinical heterogeneity that 



has been observed. We are currently in the process of 
(1) confirming that these clinical dimensions identified 
by others can be observed in the families and (2) 
determining how classification of affected individuals 
according to these dimensions influences linkage 
results. Multipoint analyses in the regions of interest 
will be conducted treating these clinical dimensions as 
well as other variables such as season of birth and age 
at onset, as covariates. 

Molecular analyses. In addition to what is described 
above, the molecular biologists have the following 
work ongoing or planned: (1) identify any potential 
candidate gene in narrowed regions and screen for 
mutations, (2) develop a yeast artificial chromosome 
(YAC) contig across the narrowed regions, (3) sub- 
clone YACs into cosmids, (4) identify genes through 
either exon trapping or cDNA screening, (5) screen 
genes for mutations, and (6) sequence the variations. 

The goals which follow gene identification are to un- 
derstand the gene function, to describe the mechanism 
responsible for the disease, to test the heterogeneity 
hypothesis, and to describe more eliologically homo- 
geneous groups. Of course, it is our hope that success 
in this work will lead to the development of a new un- 
derstanding of schizophrenia which will allow the de- 
velopment of new, more effective medications. 
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Abstracts 



It is now well established that genetic factors play an 
important role in the etiology of psychiatric diseases such 
as schizophrenia and bipolar disorder, although their eti- 
ology is not yet known. In order to further understand 
these diseases, from 1980 to 1992. the European Science 
Foundation helped to fund a large consortium of 22 insti- 
tutions from 14 European countries which brought to- 
gether pedigrees from over 150 families suffering from one 
of these two diseases. So that the study would be uniform, 
all participating groups used the same diagnostic instru- 
ments (SCAN/PSE 10 and the Schedule for Affective Dis- 
orders and Schizophrenia-Lifetime) which in turn were 
centralized using a computerized checklist. OPCRIT. 

Genotypings using 400 markers for the entire genome 
and for almost 1.600 samples were carried out by the 
Genethon laboratory in Evry. France. The statistical 
analyses based on these 640.000 genotypings using lod 
score and sib pair methods are currently in progress and 
will be discussed. 



PRELIMINARY RESULTS OF AN INTERNATIONAL 
GENOMIC SCAN FOR SCHIZOPHRENIA. L.E. DeLisi. 
M. Kelly. S. Shaw, L. Cardon, A.B. Smith, P. Hopkins. G. 
Shields. J. Loftus, S, Laval, TJ. Crow, and R. Sherrington 
Department of Psychiatry. SUNY. Stony Brook, NY 
11733; Sequana Therapeutics, Inc.. La Jolla, California* 
and the Department of Psychiatry, Oxford. United Kine- 
dom. ^ 

An initial cohort of 80 nuclear families, which con- 
sisted of 131 sib pairs with DSM-III-R schizophrenia or 
schizoaffective disorder, had DNA examined for linkages 
to 341 polymorphic PGR markers at Sequana Therapeu- 
tics, Inc.. using a systematic genomic screening approach. 
Markers were spaced approximately 10 cM apart, with no 
gap greater than 25 cM. Description of the internationally 
based clinical sample used and laboratory procedures 
have been previously reported [Garner et al.. Am J Med 
Genet 1996;67:595-610]. 

Two-point nonparametric preliminary analyses were 
performed using the program, Genehunter. Three sepa- 
rate sets of results were examined for each of 3 models for 
illnesses: 1) chronic schizophrenia only (N = 48 families). 
2) chronic schizophrenia or schizoaffective disorder {N = 
32 families), and 3) a broad category extending the diag- 
nosis to the above plus paranoid or schizotypal personality 
disorders and other nonaffective psychoses within the 80 
families. 

Using a significance level of 0.05 to identify loci of 
potential interest for further investigations, 15 were found 
on 10 separate chromosomes. The 3 most significant re- 
gions were 1) 4q21.1-q22 with an NPL of 2.33 with marker 
D4S1546 and a broad Dx {P< .009). 2) 22qll-qll.2 with 
an NPL of 2.16 with marker D22S446 and either a broad 
or schizophrenia/schizoaffective Dx {P< .01). and 3) 2pl4- 
pl5 with an NPL of 2.02 and a broad Dx (P < .02), 

Complete parametric analyses for all marker data are 
now in progress. All regions with statistical significance at 
the P = .01 level will be examined further with more 
densely placed markers in the larger cohort of 300 families 
and a new Costa Rican isolated cohort of large sibships. 

GENOME SCAN FOR SCHIZOPHRENIA GENES* A DE- 
TAILED PROGRESS REPORT IN AN IRISH COHORT 
R.E. Straub.>'3 C.J. MacLean,^'2-3 RA. O'Neill * D Walsh ^ 



and K.S. Kendler.i-2.3 ^Department of Psychiatry, Virginia 
Commonwealth University, Richmond, VA; ^Department 
of Human Genetics. Virginia Commonwealth University 
Richmond. VA; ^Virginia Institute for Psychiatric and Be- 
havioral Genetics. Virginia Commonwealth University 
Richmond, VA; -^The Department of Psychiatry The 
Queens University, Belfast, Northern Ireland: ^The 
Health Research Board, Dublin, Ireland. 

The Irish Study of High Density Schizophrenia Fami- 
lies aSHDSF) contains 1.408 individuals with DNA in 265 
families systematically ascertained from psychiatric facili- 
ties in Ireland and Northern Ireland. Using linkage analy- 
sis, we have found support for the presence of loci influ- 
encing the susceptibility to schizophrenia in three chro- 
mosomal regions: 6p24-21. 8p22-21. and 5q21-31. In order 
to identify regions containing additional susceptibility 
genes, we are currently doing both a genome scan and 
follow-up genotyping. For the scan, we have divided the 
families randomly into three sets, and are genotyping dif- 
ferent family sets with adjacent markers. We are using 
markers at approximately 20 cM intervals at first, and 
then adding intedigitating markers to create a final reso- 
lution of <10 cM. For follow-up. we have been more care- 
fully examining many potentially positive regions— those 
indicated by our own scan and those suggested by other 
investigators. This is done by testing more markers and by 
genotyping additional sets of families. The final marker 
map will, therefore, in many regions by much more dense 
than the overall maximum final intermarker distance of 
10 cM. Detailed results from both parametric and non- 
parametric linkage analysis will be provided. 



SEARCHING FOR SUSCEPTIBILITY GENES IN 
SCHIZOPHRENIA BY AFFECTED SIB-PAIR ANALY- 
SIS (GERMANY). D.B. Wildenauer.' M. Albus.^ SG 
Schwab.i J. Hallmayer,3 C. Hanses,* G.N. Eckstein * P 
Zill^ S. H6nig,4 B. Lerer,5 R. Ebstein.^ D. Lichtermann.* 
M. Trixler.' M. Borrmann.^ and W. Maier.* » Department 
of Psychiatry. University of Bonn. Bonn, Germany; ^Men- 
tal State Hospital, Haar. Germany; ^Department of Psy- 
chiatry. University of Perth. Perth, Australia: ^Depart- 
ment of Psychiatry. University of Munich. Munich. Ger- 
many; SHadassah Medical School, Jerusalem. Israel- 
Sarah Herzog Hospital, Jerusalem, Israel; ^Department 
of Psychiatry. University of Pecs, Pecs. Hungary. 

Our sample comprises 72 families with the index pa- 
tient suffering from schizophrenia or chronic schizoaffec- 
tive disorder (schizophrenic type). The minimum require- 
ment for including a family in the sample has been two 
affected siblings with parental information for analysis 
identity by descent. Thirteen families had been ascer- 
tained in Israel (mainly Sephardic origin). 2 in Hungary, 
and 57 in Germany. A genome scan has been initiated 
usmg fluorescence technology and an ABI 377 for detec- 
tion and genotyping. For first analysis, the average dis- 
tance between markers was 10-20 cM. Areas of interest 
were rescanned with markers in a distance of 2 cM and 
less To date, we have finished the scan for chromosomes 
1.2,3.4,5.6.11.13.15,18.19.21. and 22. Four potential sus- 
ceptibility loci have been detected in our family sample 
mapping to 5q31, 6p22,23. 18p. and 22ql2,13. 

Supported by Deutsche Forschungsgemeinschaft. 
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We completed a systematic genome-wide 
search for evidence of loci linked to schizo- 
phrenia using a collection of 70 pedigrees 
containing multiple atfected individuals ac- 
cording to three phenotype classifications: 
schizophrenia only (48 pedigrees; 70 sib- 
pairs); schizophrenia plus schizoaffective 
disorder (70 pedigrees; 101 sib-pairs); and a 
broad category consisting of schizophrenia, 
schizoaffective disorder, paranoid or 
schizotypal personality disorder, psychosis 
not otherwise specified (NOS), delusional 
disorder, and brief reactive psychosis (70 
pedigrees; 111 sib-pairs). All 70 families con- 
tained at least one individual atfected with 
chronic schizophrenia according to DSM- 
III-R criteria. Three hundred and thirty- 
eight markers spanning the genome were 
typed in all pedigrees for an average resolu- 
tion of 10.5 cM (range, 0-31 cM) and an av- 
erage heterozygosity of 74.3% per marker. 
The data were analyzed using multipoint 
nonparametric allele-sharing and tradi- 
tional two-point lod score analyses using 
dominant and recessive, affecteds-only 
models. Twelve chromosomes (1, 2, 4, 5, 8, 10, 
11, 12, 13, 14, 16, and 22) had at least one 
region with a nominal P value <0.05, and two 
of these chromosomes had a nominal P 
value <0.01 (chromosomes 13 and 16), using 
allele-sharing tests in GENEHUNTER. Five 
chromosomes (1, 2, 4, 11, and 13) had at least 
one marker with a lod score >2.0, allowing 
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for heterogeneity. These regions will he 
saturated with additional markers and in- 
vestigated in a new, larger set of families to 
test for replication. Am. J. Med. Genet. (Neu- 
ropsychiatr. Genet.) 81:364-376, 1998. 
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INTRODUCTION 

Schizophrenia is a severely debilitating neuropsychi- 
atric disorder characterized by distiirbance of thought, 
auditory hallucinations, and multiple delusions. Clear 
genetic susceptibility has been shown by twin concor- 
dance, familial risk, and studies of adopted-away off- 
spring of patients with schizophrenia [Kendler, 1988; 
McGuffin et al., 1984; Kety et al., 1994]. However, no 
clear mode of inheritance has been established, despite 
several attempts by segregation analysis using large 
family data sets. Schizophrenia is therefore assumed to 
belong to the class of "complex" genetic disorders (such 
as diabetes), that have a genetic predisposition due to 
more than one gene that may produce iUness indepen- 
dently in different families, or acting together to cause 
illness in all susceptible individuals. Further complica- 
tion in the search for genes for this disorder arises from 
the uncertainty of the diagnostic boundaries deter- 
mined by the underlying genetic susceptibility. On the 
basis of family studies [Kendler et al., 1993, 1995], it 
has been suggested that severe chronic schizophrenia 
is also genetically related to schizoaffective disorder, 
other psychoses, and a milder form of symptoms ex- 
pressed by paranoid and schizotypal personality disor- 
ders, but this can only be proven once genetic loci are 
found. 

For the past decade, families with multiple ill mem- 
bers with schizophrenia have been ascertained in dif- 
ferent centers around the world, and molecular genetic 
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linkage studies have been performed. Initially, most 
groups concentrated on examining a few large families 
with multiply affected individuals; however, these 
studies yielded either a few positives that were never 
independently replicated or completely negative re- 
sults Ie.g., Sherrington et aL, 1988; Crowe et al., 1991] 
More recently it has been recognized that in the case 
of complex genetic disorders, nonparametric sib-pair 
f Qoi^^® "^^^^^ ^^^^ ^^^^ significant findings [Risch, 
1990J. For these analyses, mode of inheritance does not 
have to be specified, and only clearly ill individuals 
with no diagnostic uncertainty are used in the analysis 
It has also been recognized that, with the exception 
of the sex chromosomes [DeLisi and Crow, 1989], there 
IS no a priori hypothesis that would lead one to focus on 
any given chromosomal region. Thus, the whole ge- 
nome needs to be systematically screened. A few ge- 
nome screens of schizophrenia have been pubHshed 
and/or have been presented at meetings thus far and 
are hsted in Table I. Of these, the findings are clearly 
inconsistent across studies, although some regions ap- 
pear to have weak positive replications by at least one 
other independent group. Large collaborative efforts 
have also been formed in the hope of clarifying some of 
these weak positives, but the results remain equivocal 
IGill et al., 1996; Schizophrenia Collaborative Linkage 
Group for Chromosomes 3, 6, and 8, 1996]. 

Our own studies until recently have focused on the X 
chromosome and continue to show weakly positive lod 
scores in the pericentromeric region [DeLisi et al 
1994; Dann et al., 1997]. These remain unrephcated 
Straub et al. [1995] focused on an initial finding on 
chromosome 6p. The original finding, while not consis- 
tently replicated, has produced a plethora of indepen- 
dent attempts at confirmation, yielding a broad region 
including a number of weak positives, but also some 
negative findings, covering at least 40 cM on chromo- 
some 6p [Wang et al., 1995; Antonarakis et al., 1995* 
Gurling et al., 1995; Schwab et aL, 1995; Schizophrenia 
Linkage Collaborative Group for Chromosomes 3 6 
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^"^^^^ 1996; Mowiy et al., 1995]. Our own sample of 
211 families [Garner et al., 1996] showed no evidence of 
hnkage to markers in this region. 

Ajiother region of interest is on chromosome 22q 
first suggested by Pulver et al. [1994a]. A region some- 
what distal to this has been positive in some studies 
Moises et al., 1995b; Coon et al., 1994a; Gill et al., 
i on7u^*^,''°^ f^^^^ 1995; Pulver et al, 

iQo^ '^.u^'^^''°-P°''^''^ 1994; Parsian et al., 

lyyyj. other positive findings have been reported on 
chromosomes 2, 3, 4p and q, 5p and q, 8p, 10, 13q, and 
loL^^T^^'^ Sherrington et al., 1988; Silverman et al., 

f 1 ' ^Ifo J^^x?.?^ ' ^^^^^ 1995; Antonarakis 

et al., 1996; Wildenauer, 1996] (see Table I). At pre- 
sent, these are all preHminary findings, and do not 
reach conventional levels of accepted significance 
Some clearly are false positives, while false negatives 
may have also arisen in studies with not enough 
sample power to detect linkage. 

The following study is a systematic genome-wide 
screen to determine where and how many suggestive 
regions of linkage can be found using an initial set of 
appro^mately 100 sib-pairs. These results are com- 
pared for consistency with others already reported. 

MATERIALS AND METHODS 
Clinical 

Three international sites have been used as bases for 
the recruitment of families with at least 2 available 
sibs diagnosed with schizophrenia or schizoaffective 
disorder. Details of all cHnical procedures for this co- 
Va^^i ^^^^ previously published [Garner et al., 
1996] Identification of famihes, and clinical evaluative 
and diagnostic procedures, were similar in all loca- 
tions. Recruitment involved several methods: catch- 
ment area screening, systematic contact with health 
professionals at hospital and out-patient facilities 
withm one day's driving time from the center, and ad- 
vertisement through local and national support orga- 



TABLE L Published and Presented Genome Scans of Schizophrenia as of October 1997* ' 



References 

Pulver et al [1995] 
Antonarakis et al. [1996] 
Moises et al. [1995a] 



No. of 


No. affected 


No. of 


families 


(SZ or SA) 


markers 


57 


161 


520 


1:5 


37 


413 


II: 65 


213 




7 


31 


180 


9 


35 


329 


43 


107 


335 


30** 


79 


459 


43^ 


96 




100 


200 


220 


265 


577 


Unlisted 


59 


155 


Unlisted 


17 


79 


391 


1 


17 


352 


70 


171 


338 



Spacing 



20 cM (70%) 

9 cM (84%) 

10-20 cM 
Unlisted 

10 cM 
8-10 cM 



Chromosomal regions with 
lod scores >2 or P < 0.01 



3p26-p24, 8p22-p21, 13q32, 22qll-ql2 
2p25-21, 4ql2-13, 6p21, 9p23-p21 20pl2 



20 cM 
<10cM 
10-20 cM 
10 cM 
10 cM 
10.5 cM 



None 

4pll^ll, 22qll-qter 
2q, lOq 
8p, 2q, lOp 



lOq 



Barr et al. [1994] 
Coon et al. [1994b] 
Levinson et al. [1996] 
NIMH Genetics 
Initiative [1997] 

Williams et al. [1997] 
Straub et al. H997a] 
Wildenauer et al. [1997] 
Byerley et al. [1997] 
Horvatta et al. [1997] 
Present study and 
DeLisi et al. [1 997] 

-tin^^l^T^"""^'- •^"-""^ - present on ch™o.oso»e, l,, 2p, 2,. 3p, 4p. 4,. 6p. 5.,. 6p. 8p, 9p. lOp, 10<,. lip. 13,, 16,, 20p. and 22, 

''European-American. 



5q21-Ki31, 6p24-p21, 8p22-p21, lOp 
5q31, 6p22-p23, 18p 
2pl3-pl4 
1, 4 

Iq22-q23, 2pl4-pl3, 4p, 5pl4, 10q22-q24 
llp> 13ql2^13, 16q22, 22qll 
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nizations for famihes of the mentally ill (i.e., the Na- 
tional Alhance for Mental Illness (NAMI) in the USA, 
and the Schizophrenia A National Emergency (SANE) 
in the UK). For the present genome scan and labora- 
tory analysis, a total of 70 families was examined: 42 
famihes from the USA, 14 from the UK and Ireland, 12 
from Northern Italy, and 2 from Belgium. Only fami- 
lies of European descent and Caucasian race were in- 
cluded, and of these, only those famihes in which at 
least one member of an ill sib-pair had DSM-III-R 
schizophrenia, and a minimum of one other was diag- 
nosed with schizophrenia or schizoaffective disorder. 
These 70 families were the first of a group of 100 that 
had all final diagnostic information completed and 
enough DNA available to the laboratory for a complete 
genome scan. Of the 100, 30 were ehminated due to 
genotyping problems or failure to contain one sib with 
schizophrenia only. A larger group of over 300 famihes 
was part of the cohort collected from the above sites. 
However, these were not yet ready for analyses when 
this scan began approximately 2 years ago. DNA 
screening of the total group is now in progress. All sub- 
jects signed written informed consent for participation 
in these studies. 

Diagnoses were made using DSM-III-R criteria, 
based on a combination of a structured modified SADS 
interview (Schedule for Affective Disorder and Schizo- 
phrenia [Spitzer and Endicott, 1978]) combined with 
the SIDP (Structured Interview for Personahty Disor- 
ders [Pfohl et al., 1990]) or the DIGS (Diagnostic Inter- 
view for Genetic Studies [Nurnberger et aL, 1994]), a 
structured questionnaire asked of reliable family infor- 
mants about other family members, and medical rec- 
ords as indicated. The physicians and other profession- 
als performing clinical evaluations (approximately 2 
individuals per site) had been trained in these proce- 
dures by one of us (L.E.D.), and all had undergone pe- 
riodic diagnostic reliability exercises to maintain con- 
sistency between centers. Final diagnoses were made 
by consensus in the UK by J.L., L.E.D., and T.J.C., in 
Belgium by M.D.H. and L.E.D., in Italy by M.C. and 
A.V,, and in the US by L.E.D., G.S., and M,K. All of the 
above individuals underwent periodic diagnostic reli- 
ability exercises, with kappa statistic scores ranging 
between 0.85-0.90 for primary diagnoses [Gamer et 
al., 1996]. 

Pedigrees for each family were diagrammed and all 
families received research code numbers and separate 
numbers for each individual. Computer diagnostic files 
were maintained without knowledge of laboratory 
marker data. Similarly, laboratory marker data were 
entered into the file without knowledge of diagnostic 
information. The families consisted of affected sib-pairs 
with either schizophrenia or schizoaffective disorder. 
Thirty-eight families had both parents genotyped, 27 
had one parent genotyped. and 5 families had no par- 
ents genotj^ed. 

Three phenotj^e classifications were used in the 
analysis. The most stringent phenotype contained 
pedigrees in which affected individuals with schizo- 
phrenia only were used. This category contained 48 
pedigrees, 115 affected individuals, and 70 ill sib-pairs. 
The second category contained pedigrees in which af- 



fected individuals with schizophrenia or schizoaffective 
disorder were used. This group resulted in 21 addi- 
tional pedigrees, for a total of 70 pedigrees containing 
171 affected individuals and 101 sib-pairs. The final 
category contained affected individuals with a broad 
spectrum of diagnoses including schizophrenia, schi- 
zoaffective disorder, paranoid or schizotypal personal- 
ity disorder, psychosis not otherwise specified (NOS), 
delusional disorder, and brief reactive psychosis. This 
group expanded the number of affecteds to 182 indi- 
viduals (111 ill sib-pairs). Members of some pedigrees 
also had individuals with affective disorder diagnoses. 
In order to insure that a more homogeneous phenotype 
was being tested, pedigrees in both the schizophrenia 
plus schizoaffective and broad categories were only in- 
cluded if the pedigree had at least one individual af- 
fected with schizophrenia. The number of pedigrees 
broken down by affected sibship size and phenotype is 
shown in Table 11. In addition, the total nimiber of 
affected sib-pairs for each phenotype is given. For pedi- 
grees with greater than 2 affected sibs, all possible 
combinations of sibs were counted (n(n - l)/2; n = sib- 
ship size), and therefore all affected sib-pairs counted 
are not independent. Twelve of the 70 families were 
extended beyond the nuclear family to ascertain ill 
relatives in other branches. Two families in the broad 
diagnosis category are extended famihes and each has 
2 sibships with ill individuals, accoimting for a total of 
72 sibships in 70 pedigrees. 

Genotyping and Map Construction 

Multiplex fluorescent-based genotyping was carried 
out, using methods previously described [Hall et al., 
1996]. Three hundred and thirty-eight highly polymor- 
phic microsatellite markers were chosen primarily 
from markers developed by Genethon [Dib et al., 1996], 
the Cooperative Human Linkage Center (CHLC) 
[1994], or the Utah Marker Development Group [1995]. 
All markers are described in the Genome Database 
(GDB). Markers were chosen to be genotyped as part of 
the genome scan based on location, heterozygosity, 
PGR quality, and the ability to be multiplexed. Geno- 
type errors as a result of non-Mendelian segregation in 
pedigrees were detected as described in Hall et al. 
[1996]. Individual genotypes with obvious errors were 
discarded. When more ambiguous errors occurred, i.e., 
it was not clear which family member contained the 
genotyping error, the whole pedigree was discarded for 
each particular marker. Two pedigrees consistently 
contained errors and were therefore discarded for all 

TABLE II. Number of Pedigrees Containing Various Affected 
Sibship Sizes and Total Sample Size Per Phenotype 



Number of affected 
sibs per pedigree Total numbers 



Phenotype 


2 


3 


4 


5 


Pedigrees 


ASP° 


Schizophrenia 


42 


4 


1 


1 


48 


70 


Schizophrenia + 










schizoaffective 


61 


6 


2 


1 


70 


101 


Broad 


59 


10 


2 


1 


72 


111 


"Affected sib-pairs 


(all combinations, nonindependent). 





markers (these are not included in the total number of 
70). Overall, 988 errors were detected out of a total of 
160,304 genotypes generated in this genome scan, re- 
sulting in an error rate of approximately 0.6%. 

Genetic maps were constructed for each chromosome 
using genotype data from the above 70 pedigrees. Fam- 
ily genotype data were used instead of the available 
CEPH genotype data to construct genetic maps be- 
cause of the increased number of informative meioses 
in our data set. On average, there were 350 informative 
meioses for each marker, although most meioses were 
phase-unknown since few grandparents were available 
for genotyping in this family collection (average num- 
ber of phase-known meioses per marker = 12). Genetic 
maps for each chromosome were constructed using the 
automated mapping program MULTIMAP [Matise et 
al., 19941. The resulting distances between markers 
are given in centimorgans using the Kosambi map 
function, and are similar to those determined using 
CEPH reference pedigrees. 

Statistical Analysis 

Power analyses were calculated to determine the 
Hkelihood of detecting linkage under various gene ef- 
fect sizes, using sample sizes corresponding to the 
three phenotype categories in this study. The methods 
used to determine the estimated power were the same 
as those described in Risch [1990]. Since these power 
calculations assume fully informative matings and the 
average heterozygosity of our markers was approxi- 
mately 75%, these values represent maximum power. 
In addition, since markers in this study were spaced in 
approximate 10-cM intervals, the power to detect link- 
age was calculated at a recombination fraction of 0.05 
between marker and disease loci (on average, the maxi- 
mum distance between a potential disease locus to a 
marker locus). These power calculations are shown in 
Table III. Using a maximiun lod score (MLS) threshold 
of 2, a reasonable threshold for a first-pass genome 
scan, our sample sizes exceeded the standard accept- 
able level of power (0.8) for genes with a larger effect 
(lambda-s = 4) for both the schizophrenia plus schi- 
zoaffective disorder and broad phenotype categories. 
The schizophrenia-only phenotype still has limited 
power at this gene-effect size (power = 0.62). 

All of the analyses were conducted using each of the 
three diagnostic categories described (schizophrenia 
only, schizophrenia plus schizoaffective disorder, and 
broad diagnosis). Allele frequencies for each marker 
were calculated from the family genotype data. In re- 
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gions demonstrating some evidence for Hnkage (Table 
IV), allele frequencies were varied and results from the 
analyses were compared. No significant differences in 
the results were observed. 

For each pedigree, multipoint non-parametric lod 
(NFL) scores and corresponding P values were com- 
puted at each marker and estimated at five equally 
spaced positions between each marker on the chromo- 
some, using the GENEHUNTER program [Kruglyak et 
al., 1996]. Map order and distances were used as esti- 
mated from the genetic maps constructed for each chro- 
mosome. In addition, NPL scores and corresponding P 
values were estimated 5 cM beyond the first and last 
markers, respectively, of the multipoint analyses at 1- 
cM increments. The multipoint NPL scores and corre- 
sponding P values were summed over all pedigrees. 

Two-point affecteds-only parametric analyses were 
conducted for all markers on 12 chromosomes (12 4 
5, 8, 10, 11, 12, 13, 14, 16, and 22), using the program 
MLINK of FASTLINK [Lathrop et al., 1984; Cotting- 
ham et al., 1993]. These chromosomes had at least one 
marker with a corresponding P value <0.05, using mul- 
tipoint nonparametric analysis. Both dominant and re- 
cessive models were tested for each marker. The pa- 
rameters for these models were identical to those de- 
scribed by Straub et al. [1995]. Both dominant and 
recessive models had a reduced penetrance of 0.55, a 
phenocopy rate of 0.0006, and disease allele frequen- 
cies of 0.0049 and 0.0991 for dominant and recessive 
models, respectively. Lod scores were computed for 
each pedigree and summed over all pedigrees. The re- 
combination fraction (theta) was varied between 0-0.5 
in increments of 0.02 until the lod score was maxi- 
mized. In addition, summed lod scores for each marker 
were tested for evidence of heterogeneity, using the 
program HOMOG [Ott, 1991]. Lod scores computed un- 
der heterogeneity (HLOD) were estimated by varying 
values of alpha (proportion of linked families) and 
theta until the HLOD scores were maximized. For 
parametric analyses, MLINK was chosen over GENE- 
HUNTER so that Z^^ and corresponding theta values 
could be estimated rather than calculating a lod score 
at theta = 0. 

RESULTS 

In order to assess genome coverage and determine 
accurate genetic distances between markers used in 
the genome scan, genetic maps were constructed from 
the genotype data generated from the set of 70 pedi- 
grees. The estimated genome length from this data set 



TABLE III. Power of Various Sample Sizes Corresponding to Sz 
(Schizophrenia), Sz + SA (Schizophrenia + Schizoaffective), and Broad 
Phenotype Categories, Respectively, Assuming a Recombination Fraction of 
0.05 Between Marker and Disease Loci, and Full Information Co ntent* 

>^-s = 2, X-s = 3, X-s = 4, 

MLS no. of ASP no. of ASP no. of ASP 

threshold 70 101 111 70 101 111 "to 101 TTT 

2 0.16 0.25 0.30 0.44 0.62 0.71 0.62 0.81 0 87 

5 0.04 0.10 0.14 0.17 0.38 0,48 0.30 0.59 o!71 



*ASP, affected sib-pairs. 
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TABLE IV. Markers With Parametric led Scores Above 1.5, or Peak Nonparametric Multipoint 

Values < 0.05 and Their Corresponding Phenotype 

Distance from 
pter (cM) 



NPL Scores With Corresponding P 



Marker 

Chromosome 1 

D1S551 

D1S196 

D1S1699 

Chromosome 2 

D2S1337 

Chromosome 4 
D4S1546 

D4S2456 

D4S400 

Chromosome 5 

D5S406 

D5S433 

Chromosome 8 

D8S560 

GAAT1A4 

Chromosome 10 

D10S677 

Chromosome 11 

D11S1393 

D11S2002 

Chromosome 12 

D12S85 

D12S342 

Chromosome 13 

D13S1293 

D13S168 

D13S170 

D13S1315 

Chromosome 14 

D14S290 

D14S1012 

Chromosome 16 

D16S421 

Chromosome 22 

D22S446 

D22S283 



Chromosome 
location 



Phenotype 2^^ Theta HLOD Theta Alpha NPL(/'<) 



"Dominant. 
^'Recessive. 



Ip31-pl3 
Iq22-q23 
Iq31-q32 


118.4 

IRA 1 

207.0 


Broad 
Sz + SA 
Broad 


1.41 
1.81 
0.46 


0.20^ 
0.18'' 
0,28'' 


1.41 
2.40 
0.57 


0.20 
0,00 
0.04 


1.00 
0.46 
0.22 


2pl4-pl3 


60.2 


Sz + SA 
Broad 


1.56 
1.60 


0.16« 
0.18° 


2.19 
1.98 


0.00 
0.00 


0.41 
0.36 


4pter-cen 

4al3-a21 
4qll-q21 


18.9 

OO.O 

68.2 


Sz 

Sz + SA 
Sz 

Sz + SA 


1.91 
1.22 
1.89 
1.48 


0.12° 
0.20° 
0.12^ 
0.16° 


2.44 
2.05 
1.89 
1.94 


0.00 
0.00 
0.12 
0.00 


0.56 
0.40 
1.00 
0.50 


5pl5 

5pl4-q21 


0.0 
109.8 


Sz + SA 
Sz + SA 


0.85 
0.91 


0.26" 
0.26'' 


0.85 
1.00 


0.26 
0.08 


1.00 
0.33 


8p22-p21 
8q22 


36.0 
103.7 


Sz 

Broad 


1.99 
0.37 


o.ie'' 

0.30'' 


1.99 
0.37 


0.16 
0.30 


1.00 
1.00 


10q22-Hq24 


113.8 


Sz + SA 


0.74 


0.26'' 


0.74 


0.20 


0.60 


llpter-pll 
Ilq21-q23 


59.0 
78.2 


Sz 
Sz 


2.30 
0.88 


0.10" 
0.26'' 


2.40 
1.59 


0.00 
0.00 


0.59 
0.28 


12pl3^24 
12q22-q24 


45.9 
131.4 


Sz + SA 
Broad 


0.34 
0.82 


0.26° 
0.24° 


0.34 
0.82 


0.20 
0.00 


0.64 
0,28 


13ql2-ql3 
13ql4 
13q31 
13q34 


16.5 
35.6 
65.8 
108.4 


Sz 
Sz 

Sz + SA 
Broad 


2.85 
1.56 
0.63 
0.58 


0.10° 
0.16° 
0.28° 
0.28° 


2.85 
1.72 
1.25 
0.64 


0.10 
0.00 
0.00 
0.00 


1.00 
0.49 
0,25 
0.23 


14q21-q23 
14q21-q23 


51.0 
64.4 


Sz + SA 
Sz + SA 


0.02 
1.35 


O.SS** 
0.24'' 


0.45 
1.99 


0.00 
0.00 


0.17 
0.29 


16q22.1 


62.7 


Sz 


0.12 


0.32'' 


0.21 


0.00 


0.17 


22qll 
22ql2-ql3 


13.8 
42.0 


Sz + SA 
Sz 


1.50 
1.77 


0.22'' 
0.20" 


1.50 
1.77 


0.22 
0.20 


1.00 
1.00 



1.65 (.05) 
1.04 (.14) 
1.86 (0.03) 

1.97 (.02) 
2.13 (.01) 

1.35 (.05) 
1.96 (.02) 

1.59 (.03) 
1,75 (.03) 

2.18 (.01) 

1.60 (.05) 

1.20 (.07) 

1.58 (.05) 

2.08 (.01) 

0.89 (.14) 

1.59 (.03) 

2.01 (.02) 
1.57 (.05) 

1.80 (.01) 
1.51 (.03) 
1.83 (.03) 
1.69 (.04) 

1.99 (.02) 
1.71 (.04) 

2.09 (.01) 

2.16 (.01) 
1.64 (.02) 



was 3,310.3 cM, providing an average resolution of 10 5 
cM between each of the 338 markers, with an interval 
range of 0-31 cM. Most intervals between markers 
were between 5-20 cM (228 out of 315 intervals; 72%). 
Seven intervals had no recombination between mark- 
ers. Markers chosen for the genome scan were highly 
polymorphic microsatellite markers, with an average 
observed heterozygosity of 74.3%. 

Each of the 338 markers was analyzed using multi- 
point nonparametric allele-sharing tests in GENE- 
HUNTER for each of the three phenotype classifica- 
tions described, using map order and distances esti- 
mated from the family genotype data. Figure 1 shows 
the multipoint curves for each chromosome. Chromo- 
somes 1, 2, 4, 5, 8, 10, 11, 12, 13, 14, 16, and 22 have at 
least one peak NPL score with a corresponding P value 
<0.05 (see Table IV). These P values are uncorrected 
lor genome-wide tests. The most significant peak from 
the nonparametric multipoint analyses occurred at 
D16S421 located in 16q22.1, using a schizophrenia- 
only phenotype (NPL = 2.09, P < 0.01) 



content was also calculated using 
GENEHUNTER at the same positions as the multi 
point analyses, i.e., for all markers along the chromo- 
some and in five equally spaced positions between each 
marker. The average information content for all auto- 
somal markers was 0.67 (range, 0.52-0.78). The infor- 
mation content curves for each chromosome are shown 
m Figure 2. 

In order to obtain more evidence for the suggestive 
regions found using nonparametric Hnkage analysis 
parametric tests were conducted on the 12 chromo- 
somes demonstrating evidence for linkage. Both domi- 
nant and recessive affecteds-only models were used un- 
der each of the three diagnostic phenotypes described 
using MLINK of FASTLINK. In addition to calculating 
the lod score, an HLOD and corresponding alpha value 
was also calculated using HOMOG in order to deter- 
mine the lod score under heterogeneity and the propor- 
tion of famihes linked to a particular marker, respec- 
tively. Nine of the 12 chromosomes analyzed had at 



TABLE V, All Loci Examined and Corresponding Highest 
Multipoint NPL Score Obtained 



Distance from 



Marker 


pter (cM) 


Phenotype 


NPL 


P< 


Chromosome 1 










D1S243 


0.0 


SZ 


-0.08 


0.54 


D1S548 


9.2 


Broad 


0.62 


0.25 


D1S228 


23.6 


Broad 


0.64 


0.25 


D1S552 


38.7 


Sz + SA 


0.51 


0.29 


D1S164 


56.7 


Sz + SA 


1.51 


0.06 


D1S3175 


77.7 


Broad 


1.44 


0.07 


D1S232 


82.7 


Broad 


0.80 


0.20 


D1S405 


90.2 


Broad 


0.45 


0.32 


D1S209 


103.0 


Broad 


-0.06 


0.52 


D1S192 


107.0 


Broad 


0.49 


0.30 


D1S551 


118.4 


Broad 


1.65 


0.04 


D1S167 


128.1 


Broad 


1.34 


0.08 


D1S223 


139.7 


Broad 


0.77 


0.21 


D1S453 


159.1 


Sz 


0.12 


0.44 


D1S484 


171.9 


Sz + SA 


0.49 


0.30 


D1S194 


178.1 


Sz + SA 


0.55 


0.28 


D1S196 


184.1 


Sz + SA 


1.04 


0.14 


D1S466 


202.6 


Broad 


1.06 


0.13 


D1S240 


205.4 


Broad 


1.79 


0.03 


D1S1599 


207.0 


Broad 


1.86 


0-03 


D1S245 


229.3 


Sz 


1.36 


0.05 


D1S1602 


234.5 


Sz 


1.35 


0.05 


D1S235 


256.7 


Sz 


0.72 


0.18 


D1S517 


266.3 


Broad 


0.44 


0.32 


D1S2842 


276.9 


Sz 


0.41 


0.30 


Chromosome 2 










D2S1329 


0.0 


Sz 


0.63 


0.22 


D2S165 


25.2 


Sz 


1.13 


0.08 


D2S367 


35.5 


Broad 


0.88 


0.18 


D2S391 


50.0 


Broad 


1.43 


0.07 


D2S1337 


60.2 


Broad 


2.13 


0.01 


D2S438 


76.4 


Broad 


1.55 


0.05 


D2S139 


82.5 


Sz + SA 


1.57 


0.05 


D2S135 


91.6 


Sz + SA 


1.44 


0.06 


D2S2224 


112.1 


Sz + SA 


1.40 


0.07 


D2S347 


113.1 


Sz 


1.07 


0.09 


D2S114 


124.9 


Broad 


1.11 


0.12 


D2S368 


126.1 


Broad 


1.09 


0.13 


D2S150 


130.8 


Broad 


0.55 


0.28 


D2S141 


144.3 


Sz 


0.14 


0.43 


D2S93 


145.4 


Sz 


0.51 


0.26 


D2S418 


145.5 


Sz 


0.52 


0.26 


D2S326 


157.6 


Sz 


0.27 


0.37 


D2S389 


169.2 


Sz 


-0.39 


0.69 


D2S115 


175.4 


Sz + SA 


0.06 


0.47 


D2S116 


176.8 


Sz + SA 


-0.06 


0.52 


D2S422 


181.5 


Sz + SA 


-0.18 


0.57 


D2S434 


192.2 


Sz 


0.01 


0.49 


D2S126 


196.3 


Sz 


-0.13 


0.56 


D2S439 


206.7 


Sz 


0.50 


0.26 


D2S206 


212.0 


Sz 


0.55 


0.24 


D2S338 


230.1 


Broad 


1.02 


0.14 


Chromosome 3 










D3S1307 


0.0 


Sz 


0.13 


0.43 


D3S1297 


2,5 


Sz 


0.06 


0.46 


D3S1304 


13.9 


Sz 


0.42 


0.30 


D3S1597 


18.4 


Sz 


0.22 


0.39 


D3S1286 


28.9 


Sz + SA 


0.23 


0.40 


D3S2396 


45.6 


Broad 


-0.53 


0.71 


D3S1768 


52.3 


Sz 


-0.10 


0.55 


D3S1611 


52.4 


Sz 


-0.11 


0.55 


D3S1300 


73.4 


Broad 


0.21 


0.41 


D3S3571 


82.3 


Broad 


0.11 


0.45 


D3S1598 


93.7 


Sz 


-0.27 


0.63 


D3S2406 


95.6 


Sz 


-0.76 


0.83 


D3S2388 


99.5 


Sz 


-0.83 


0.85 


D3S1276 


100.3 


Sz 


-0.74 


0.82 
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TABLE V. (Continued) 





l^iSlcUlCC iruiii 

pier i.Livx^ 


Pnenotype 


IN PL 


P< 


D3S2386 


100.1 


Sz 


-0.78 


0.83 


D3S1271 


103.7 


Sz 


-0.61 


0.77 


D3S1610 


113.8 


Sz 


-0.23 


0.61 


D3S1303 


119.7 


Sz 


-0.48 


0.72 


D3S1292 


131.3 


Broad 


0.76 


0.21 


D3S1569 


143.3 


Broad 


0.87 


0.18 


D3S196 


146.3 


Broad 


1.32 


0.09 


D3S1282 


161.6 


Broad 


-0.02 


0.51 


D3S1754 


177.9 


Sz + SA 


0.27 


0.38 


D3S2398 


199.1 


Sz + SA 


1.28 


0.09 


D3S1314 


201.1 


Sz + SA 


1.07 


0.13 


Chromosome 4 








D4S394 


0.0 


Sz + SA 


0.54 


0.28 


D4S1546 


18.9 


Sz + SA 


1.96 


0.02 


D3A632 


32.5 


Sz + SA 


0.31 


0.37 


D4S2382 


42.9 


Sz 


-0.33 


0.66 


D4S2379 


54.3 


Sz 


0.16 


0.42 


D4S2456 


65.3 


Sz + SA 


1.63 


0.04 


D4S400 


68.2 


Sz + SA 


1.75 


0.03 


D4S1647 


89.7 


Broad 


1.40 


0.07 


D4S1651 


103.1 


Sz 


0.68 


0.20 


D4S420 


121,0 


Broad 


-0.12 


0.55 


D4S1625 


124.7 


Sz + SA 


-0.02 


0.50 


D4S413 


136.5 


Broad 


0.32 


0.36 


D4S2368 


143.7 


Broad 


0.40 


0.33 


D4S415 


154.8 


Broad 


1.20 


0.10 


D4S408 


168.1 


Broad 


1.57 


0.05 


Chromosome 5 








D5S406 


0.0 


Sz + SA 


2.18 


0.01 


D5S807 


9.9 


Broad 


1.49 


0.06 


D5S416 


19.9 


Broad 


0.68 


0.23 


D5S268 


23.9 


Broad 


0.46 


0.31 


D5S477 


37.4 


Broad 


-0.23 


0.59 


D5S407 


56.9 


Sz 


-0.04 


0.52 


D5S424 


78.3 


Sz + SA 


0.39 


0.33 


D5S401 


95.0 


Sz + SA 


1.09 


0.12 


D5S433 


109.8 


Sz + SA 


1.60 


0.05 


D5S804 


126.8 


Sz 


-0.19 


0.59 


D5S399 


135.8 


Sz + SA 


0.19 


0.42 


FGFA 


141.3 


Sz + SA 


0.05 


0.48 


D5S2112 


156.8 


Sz + SA 


-0.75 


0.79 


D5S1475 


170.1 


Sz 


0.24 


0.38 


D5S429 


183.1 


Sz 


0.61 


0.22 


D5S498 


191.0 


Sz 


0.80 


0.16 


Chromosome 6 










D6S344 


0.0 


Sz 


-0.16 


0.57 


D6S277 


15.1 


Sz 


-0.63 


0.78 


D6S1006 


21.2 


Sz 


-0.96 


0.88 


D6S399 


21.3 


Sz 


-0.95 


0.88 


D6S422 


35.9 


Broad 


0.43 


0.32 


D6S291 


50.1 


Sz 


0.19 


0.40 


D6S426 


57.4 


Sz 


0.43 


0.29 


D6S1018 


64.1 


Sz 


0.59 


0.23 


D6S493 


80.7 


Sz 


0.28 


0.36 


D6S501 


96.0 


Sz 


-0,22 


0.61 


D6S474 


111.3 


Broad 


0.48 


0.30 


D6S1009 


130.7 


Sz + SA 


1.34 


0.08 


D6S495 


146.0 


Sz + SA 


0.67 


0.24 


D6S1008 


162.1 


Sz + SA 


0,82 


0.19 


D6S281 


180.4 


Sz 


-0.69 


0.80 


Chromosome 7 








D7S513 


0.0 


Sz 


-0.06 


0.52 


D7S493 


13.2 


Sz 


-0.51 


0.73 


D7S484 


30.0 


Sz 


0.10 


0.44 


D7S645 


52.1 


Sz + SA 


-0.29 


0.62 


D7S1511 


52.9 


Broad 


-0.07 


0.52 


D7S1797 


61.6 


Broad 


-0.64 


0.75 


D7S524 


66.5 


Sz 


-1.27 


0.95 


D7S1789 


71.2 


Broad 


-0.57 


0.72 
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TABLE V. All Loci Examined and Corresponding Highest 
Multipoint NPL Score Obtained 



Marker 


Distance from 
pter (cM) 


Phenotype 


NPL 


P< 


D7S518 


OV.D 


Droaa 


U.OO 




0.18 


X)7S677 


Q9 7 


xsroaci 


1.49 


0.06 




1 


oroao 


1.36 


0.08 


D7S530 


VO.iJ 


Broad 


1.18 


0.11 




lUO.tJ 


xtroaa 


U.o7 


0.18 


D7S794 


1 Ifl 7 
X 10. 4 


Broad 


1.10 


0.12 


U 1 oxou / 




OZ 


0.57 


0.24 


^^iirumusoine o 










u.u 


rtroaa 


0.47 


0.31 




1 / .0 


bZ 


-0.39 


0.68 




01 0 


oz 


-0.42 


0.69 




OO.V 


OZ 


1.20 


0.07 


D8S2fl2 


00.0 


OZ 


1.19 


0.07 


UOiLj i. XUrr 


Oo.D 


OZ 


0.77 


0.17 




fli 7 
DX. / 


OZ 


0.78 


0.17 




R7 ft 


oZ 


1.33 


0.05 




fl9 R 


OZ 


0.68 


0.20 




1 flQ K 
XUO.O 


Droaa 


1.58 


0.05 




1 1 Q fl 
llO.O 


oroad 


1.03 


0.14 




1 9*1 a 


OZ 


n DO 
U.9d 


0.13 




1 /t9 A 


Grr 

OZ 


0.53 


0.25 




1 ft9 rt 


G^ 
OZ 


0.31 


0.35 


Chromosonie 9 










u.u 


tsroaa 


1.16 


0.11 




lo.y 


Broad 


0.77 


0.21 




28.1 


Broad 


0.34 


0.35 




OQ Q 


ISroad 


0.43 


0.32 




>in 0 
4u.y 


rtroad 


0.93 


0.16 


nqcqni 


40.0 


ttroaa 


1.19 


0.10 




OD. / 


G»» _i CA 
OZ + oA 


0.55 


0.28 




fifl 9 
DO.Z 


rtroaa 


1.06 


0.13 


DQS278 
uu(y£i 1 o 


71 A 
4 X.4 


Broad 


1.02 


0.14 


DQSl 7fi 


7Q ft 


Q„ , GA 
OZ + oA 


0.78 


0.20 


nQQ9QQ 


fift Q 
00. 0 


G™ I G A 

OZ + oA 


1.02 


0.14 




1 Aft 
XUO.D 


OZ 


0.26 


0.37 




118 7 
XXo. / 


G-F 

OZ 


n 00 
U.OO 


0.15 




1 '^A. 
Xu4.u 


G<» 
OZ 


0.61 


0.22 


Olironiosome 10 










1)10551919 


u.u 


G"* 
OZ 


0.63 


0.21 




U.l 


o~ 
OZ 


0.66 


0.20 


UXUOX #£7 


ICQ 


OZ 


0.80 


0.16 


LIIUOD4/ 


24.9 


Broad 


1.23 


0.10 


JJ1U01Z14 


29.1 


Broad 


1.27 


0.09 


01 flOl Q1 
UlUolbi 


35.8 


Broad 


1.37 


0.08 


JJlUODli 


52.1 


Sz 


0.74 


0.18 


JJlUobUl 


55.6 


Sz 


0.89 


0.13 


TM ftQl j<oe 


56.0 


Sz 


0.82 


0.15 




66.0 


Sz 


1.27 


0.06 


ni fiCi 01 1 
iJlUolZil 


77.7 


Sz + SA 


1.39 


0.07 


UlUoDo/ 


83.9 


Sz + SA 


1.30 


0.08 




o7.b 


Sz + SA 


1.10 


0.12 


UlUoZoZ / 


94.7 


Sz + SA 


1.62 


0.04 


uxuoo / / 


lio.o 


Sz + SA 


2.08 


0.01 




122.1 


Broad 


1.45 


0.06 


JJIUOO /U 


122.2 


Broad 


1.44 


0.07 


D10S610 


134.1 


Broad 


0.66 


0.24 


D10S587 


148.4 


Broad 


-0.23 


0.59 


D10S1655 


165.6 


Sz 


-0.31 


0.65 


Chromosome 11 








D11S1984 


0.0 


Sz 


0.82 


0.16 


D11S1999 


16.5 


Sz 


0.74 


0.18 


D11S419 


25.3 


Sz 


0.53 


0.26 


D11S902 


28.2 


Sz 


0.38 


0.32 


D11S1888 


29.1 


Sz 


0.35 


0.33 


D11S1322 


47.1 


Sz + SA 


1.04 


0,14 


D11S1301 


48.0 


Sz + SA 


1.07 


0.13 


D11S1968 


48.1 


Sz + SA 


1.06 


0.13 


D11S4200 


49.5 


Sz + SA 


0.62 


0.25 



TABLE V. (Continued) 



Marker 


Distance from 
pter (cM) 




MPT 


r< 


D11S1393 


59.0 


Sz 


0.89 


0.14 


D11S1385 


61.8 


Sz 


0.54 


0.25 


D11S1335 


62.2 


Sz 


0.54 


0.25 


D11S1298 


62.3 


Sz 


0.54 


0.25 


D11S2002 


78.2 


Sz 


1.59 


0,03 


D11S1367 


84.5 


Broad 


1.35 


0.08 


D11S35 


92.3 


Broad 


1.21 


0.10 


D11S1391 


101.7 


Broad 


0.88 


0,18 


D11S924 


114.7 


Broad 


-0.01 


0.50 


D11S925 


117.6 


Sz + SA 


0.11 


0.45 


D11S4150 


135.1 


Sz 


1.09 


0.09 


D11S968 


149.2 


Sz 


0.93 


0.13 


Chromosome 12 








D12S1725 


0.0 


Broad 


0.44 


0.32 


D12S77 


10.7 


Sz 


0.86 


0.14 


D12S364 


18.8 


Broad 


1.82 


0.03 


D12S373 


21.1 


Broad 


1.87 


0.03 


D12S87 


37.3 


Broad 


1.95 


0.02 


D12S85 


45.9 


Sz + SA 


2.01 


0.02 


D12S297 


53.1 


Sz + SA 


0,93 


0.16 


D12S398 


54.1 


Sz + SA 


0.71 


0,22 


D12S83 


59.5 


Sz + SA 


1.05 


0.13 


D12S375 


69.7 


Sz + SA 


1.18 


0.11 


D12S1717 


86.1 


Broad 


0.38 


0.34 


PAH 


98.3 


Sz 


0.40 


0.30 


D12S1074 


99.7 


Sz 


0.30 


0.35 


D12S1091 


104.0 


Sz 


-0.06 


0.53 


D12S79 


116.4 


Broad 


1.08 


0.13 


D12S378 


127.9 


Broad 


1.39 


0.07 


D12S342 


131.4 


Broad 


1.57 


0.05 


D12S343 


151,4 


Broad 


1.08 


0.13 


Chromosome 13 






D13S283 


0.0 


Sz + SA 


0.77 


0.21 


D131293 


16.5 


Sz 


1.80 


0.01 


D13S220 


18.2 


Sz 


1.72 


0.02 


D13S325 


27.3 


Sz 


1.56 


0.03 


D13S328 


33.6 


Sz 


1.25 


0.06 


D13S168 


35.6 


Sz 


1.51 


0.03 


D13S318 


53.5 


Sz + SA 


0.49 


0.30 


D13S170 


65.8 


Sz + SA 


1.83 


0,03 


D13S317 


67.4 


Sz + SA 


1.63 


0.04 


D13S762 


74.0 


Sz + SA 


1.34 


0.08 


D13S158 


91.8 


Broad 


0.60 


0.26 


D13S173 


101.6 


Broad 


1.10 


0,12 


D13S1315 


108.4 


Broad 


1.69 


0.04 


D13S285 


116.4 


Broad 


1.59 


0.05 


D13S293 


127.2 


Broad 


0,78 


0.20 


Chromosome 14 








D14S50 


0.0 


Sz + SA 


1.64 


0.04 


D14S264 


10.1 


Sz + SA 


0.76 


0.21 


D14S70 


24.8 


Sz + SA 


0.20 


0.41 


D14S278 


32.7 


Sz + SA 


0.78 


0.20 


D14S269 


39.3 


Sz + SA 


0.95 


0.15 


D14S290 


51.0 


Sz + SA 


1,99 


0.02 


D14S1012 


54.4 


Sz + SA 


1.71 


0,04 


D14S74 


72.1 


Sz + SA 


1.31 


0.08 


D14S606 


75.9 


Sz + SA 


1.07 


0.13 


U14bzo0 


86.9 


Sz 


0.53 


0.25 


D14S302 


94.1 


Sz 


0.55 


0.24 


D14S267 


108.2 


Sz 


1.44 


0.04 


D14S1010 


120.5 


Sz 


0.94 


0.12 


Chromosome 15 








D15S128 


0.0 


Sz 


0.25 


0.37 


D15S144 


17.5 


Sz 


0.89 


0.13 


D15S641 


26.3 


Sz 


0.92 


0.12 


D15S659 


36.5 


Sz 


0.78 


0.16 


D15S148 


46.2 


Sz 


1.02 


0.10 


D15S159 


56.4 


Sz 


0.57 


0.24 


D15S131 


64.2 


Sz 


0.55 


0,24 



TABLE V. All Loci Examined and Corresponding Highest 
Multipoint NPL Score Obtained 
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Marker 



Distance from 
pter CcM) 



D15S152 

D15S130 

D15S207 

D15S642 

Chromosome 16 

D16S407 

D16S748 

D16S500 

D16S417 

D16S401 

D16S753 

D16S408 

D16S320 

D16S421 

D16S752 

D16S750 

D16S3037 

Chromosome 17 

D17S1828 

D17S1303 

D17S969 

D17S799 

D17S250 

D17S808 

D17S948 

D17S1291 

D17S968 

D17S784 

Chromosome 18 

D18S452 

D18S53 

D18D847 

D18S548 

D18S487 

D18S849 

D18S68 

D18S541 

D18S1161 

Chromosome 19 

D19S878 

D19S177 

D19S406 

D19S391 

D19S432 

D19S434 

D19S433 

D19S225 

D19S412 

D19S601 

D19S418 

D19S254 

Chromosome 20 

D20S889 

D20S189 

D20S471 

D20S200 

D20S119 

D20S476 

D20S171 

Chromosome 21 

D21S1431 

D21S265 

D21S1435 

D21S167 

D21S1261 

Chromosome 22 

F8VWFP 

D22S446 



Phenotype NPL P< 



Marker 



71.9 
93.3 
101.2 
121.9 

0.0 
6.1 
10.2 
23.5 
26.0 
36.8 
48.3 
52.0 
62.7 
65.0 
83.9 
100.7 

0.0 
27.1 
29.2 
32.4 
55.9 
80.0 
82.6 
89.1 
106.4 
122.3 

0.0 
16.1 
27.2 
36.8 
47.7 
54.7 
64.9 
77.0 
83.0 

0.0 

12.7 

20.5 

25.7 

38.2 

48.6 

51.2 

56.4 

71.9 

88.5 
100.3 
108.8 

0.0 
24.7 
41.1 
52.5 
67.2 
79.8 
101.7 

0.0 
13.7 
16.6 
33.5 
49.5 

0.0 
13.8 



Sz 

Broad 
Broad 
Broad 

Broad 
Sz + SA 
Sz 

Sz + SA 
Sz 

Sz + SA 

Sz 

Sz 

Sz 

Sz 

Sz + SA 
Sz 

Broad 

Sz 

Sz 

Sz 

Sz 

Sz 

Sz 

Sz 

Sz 

Broad 

Sz 
Sz 
Sz 
Sz 
Sz 
Sz 
Sz 
Sz 
Sz 

Sz + SA 

Sz + SA 

Sz + SA 

Sz + SA 

Broad 

Sz 

Sz 

Sz 

Sz 

Sz 

Sz 

Sz 

Broad 
Sz 
Sz 
Sz 

Sz + SA 
Sz 

Broad 

Sz + SA 
Sz + SA 
Broad 
Sz + SA 
Broad 

Sz + SA 
Sz + SA 



0.17 
0.25 
0.16 
0.99 

-0.61 
-0.64 
-0.98 
-0.09 
0.02 
0.61 
1.17 
1.27 
2.09 
1.83 
0.25 
0.52 

-0.17 
0.01 
0.39 
0.74 
0.17 
0.06 
0.20 
0.30 

-0.15 
0.55 

0.71 
1.00 
0.25 
0.31 
0.23 
0.47 
0.32 
-0.28 
-0.19 

0.35 

0.44 

1.26 

0.80 
-0.29 
-1.00 
-0.62 
-0.64 

0.42 

0.80 

0.77 

0.53 

-0.22 
0.66 
0.32 
0.35 
0.67 
1.03 
0.18 

0.04 

0.36 

0.48 
-0.28 
-0,47 



0.41 
0.39 
0.43 
0.15 

0.74 
0.75 
0.89 
0.54 
0.49 
0.26 
0.07 
0.06 
0.01 
0.01 
0.39 
0.26 

0.57 
0.49 
0.31 
0.18 
0.41 
0.47 
0.40 
0.35 
0.57 
0.28 



0.19 
0,11 
0.38 
0.35 
0.38 
0.28 
0.34 
0.63 
0.59 

0.35 
0.32 
0.09 
0.20 
0.62 
0.89 
0.78 
0.79 
0.30 
0.16 
0.17 
0.25 

0.59 
0.20 
0.34 
0.33 
0.24 
0.10 
0.42 

0.48 
0.35 
0.30 
0.61 
0.69 



D22S275 

D22S278 

D22S283 

D22S445 

D22S274 

X chromosome 

DXS8051 

DXS207 

DMD 

MAOB 

DXS1055 

DXS991 

DXS6786 

DXS990 

DXS1191 

DXS1220 

DXS1047 

DXS1227 

DXS1123 



Distance from 
pter (cM) 



Phenotype NPL P< 



31.7 
39.6 
42.0 
46.8 
57.1 

0.0 
17.2 
44.2 
57.9 
66.1 
74.5 
90.5 
92.9 
101.6 
107.6 
139.1 
149.8 
168.2 



Sz 
Sz 
Sz 
Sz 

Sz + SA 
Sz 

Sz + SA 
Sz + SA 
Sz + SA 
Broad 
Broad 
Broad 
Sz + SA 
Sz + SA 
Sz + SA 
Sz + SA 
Sz + SA 
Broad 



1.22 
1.35 
1.64 
0.93 
0.88 

0.12 
0.09 
0.34 
0.24 
0.15 
0.58 
0.43 
0.48 
0.54 
0.78 
0.93 
0.59 
1.34 



0.06 
0.05 
0.02 
0.12 
0.17 

0.44 
0.46 
0.36 
0.40 
0.43 
0.27 
0.32 
0.30 
0.28 
0.20 
0.16 
0.26 
0.08 



DISCUSSION 



1.78 
2.16 



0.03 
0.01 



The results from this initial systematic genome-wide 
n^"^'^'^^ with schizophrenia do not provide 
statistically significant evidence for a major genetic lo- 
cus, but weak positive lod scores were present within 
as many as 12 chromosomal regions (Iq22-q32 2d14- 

f^?i3«/4i5. '^''^». 

A possible linkage region on chromosome Iq22-q32 
l^il^^o^^"^^** evidence from markers D1S1599 and 
JL>1J>196, located approximately 23 cM apart (HLOD = 
2.4; alpha = 0.48). No other investigative groups are 
known to have positive findings in this region 

On chromosome 2pl4-12, marker D2S1337 shows a 

nt^^'ifo?\ = <> 01> and maximum 

HLOD of 2.2 (alpha = 0.41). Moises et al. [1995a] re- 
ported hnkage to D2S135 (P = l.l x 10-«) located in 
2ql2-ql4, approximately 30 cM away from the region 
detected in this scan, and Byerley et al. (unpubUshed 
data) have lod scores suggestive of Unkage within this 
region in a cohort of families with schizophrenia from 
the island of Palau. 

Another region of potential interest is located on 
chromosome 4; positive markers span an approximate 
fa^'flf^'''- °4S1546 (18.9 cM from pter), D4S2456 
065.4 cM from pter), and D4S400 (68.3 cM from pter) 
Ihe nonparametric multipoint curve for chromosome 4 
indicates two distinct peak regions. The genome scans 
of Moises et al. [1995a] and Williams et al. [1997] re- 
sulted in positive scores in this region 

Straub et al. [1997] recently reported evidence for a 
schizophrenia susceptibility locus within 5q22-q31 
One marker from our present study, D5S433. within 
? M''of7i" of 1.60, P < 0.05. Wildenauer et 

al. [1997] also provided positive data in this region 
l\vo groups have reported evidence for a locus within 
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Fig. 1. Multipoint nonparametric linkage analysis of the genome. Three hundred and thirty-eight markers spanning the entire genome were analyzed 
LTL"^n n V"r^T*u' t GENEHUNTER. Cumulative map distances in centimorgans sorting fh)mTe fosT ptermS 

IS ! T ^"^ M^ltipomt NPL scores totaled over all pedigrees are given on each Y-axis. The resulting multipoint curve using the schizo- 
phrema-only phenotype is shown in blue, the schizophrema plus schizoaffective disorder phenotype is shown in red. and the broad diagnosis phenotype 
IB snown m green. 
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Fig. 2. Information content curve for each chromosome. The information content (Y-axis) at each marker and in five equally spaced intervals between 
LnLr^O 52 w!^^^^^^^ ^"^-^ GENEHUNTER The average information content for all autosomal markers wis ^67^ lowes^^^^^^ 

Zmt^r:^.^m ^ °" chromosome 21, and the highest information content (0.78) occurred in proximal UqTdTun^ 
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8p22-21 [Kendler et al., 1996; Pulver et al, 1995]. One 
marker in this genome scan was positive in this region 
(D8S560, Z„3^ = 1.99, theta of 0.16 with recessive in- 
heritance), and similarly, the NIMH Collaborative Ini- 
tiative for schizophrenia [1997] also noted that this re- 
gion was of possible interest. 

The chromosome 13ql2-14 region also provided posi- 
tive results, specifically with markers D13S1293 (dis- 
tance from pter, 16.5 cM; schizophrenia-only diagnosis) 
and D13S168 (distance from pter, 35.6 cM; broad diag- 
nosis). Both nonparametric and parametric scores pro- 
duced the highest peaks with a dominant model (Z„^ 
- 2.85, theta = 0.10). In addition, Lin et al. [1995] 
reported suggestive evidence for Hnkage to a locus in 
13ql4.1-q32, while Antonarakis et al. [1996] reported 
a positive finding at a region in 13q32. One marker in 
our study, D13S170, is approximately 15 cM from the 
reported region in 13q32 and shows shght evidence for 
linkage using nonparametric analysis (NPL = 1 83 P 
< 0.03). 

A possible locus on chromosome 16q 13-22 is sup- 
ported by results from D16S320 and D16S421, located 
within an approximate 12-cM region. All markers in 
this region of chromosome 16 support a locus using the 
schizophrenia phenotype and a recessive mode of in- 
heritance when using parametric analysis. A previous 
study which contained two overlapping families with 
the present study [Polymeropoulos et al., 1995] also 
showed evidence of linkage within this region (sib-pair 
analysis chi-square - 9.76, P < 0.0009) at D16S266. 

One of the more interesting positive findings in this 
search is on chromosome 22 with D22S446 (distance 
from pter, 13.8 cM) and D22S283 (distance from pter, 
42.1 cM). It appears from the nonparametric multi- 
point analysis that these two linkage regions may be 
distinct. Carlson et al. [1997] defined a 3-Mb region in 
22qll that is commonly deleted in some psychiatric 
patients with velo-cardio-facial syndrome (VCFS). 
D22S426 is within 1 cM of the region defined by Carl- 
son et al. [1997], Several investigators found evidence 
for hnkage to another region spanning approximately 
20 cM in 22ql2-ql3 [Pulver et al., 1994a; Coon et al 
1994a; Gill et al., 1996; Moises et al., 1995b], which is 
the same region detected by Marker D22S283. Some 
groups have subsequently failed to replicate these find- 
ings in separate studies [Kalsi et al., 1995; Pulver et 
al., 1994b; WiUiams et al., 1997; Parsian et al., 1997]. 
Our own previous study of this region [Polymeropoulos 
et al., 1995] was essentially negative. Ten of the 104 
famihes from the study of Polymeropoulos et al. [1995] 
overlapped with those of the present study. Analysis 
without these famihes did not make a difference in any 
of the reported chromosome 22 results. 

With regard to the X chromosome, this genome-wide 
scan fails to support our previously reported weak posi- 
tive findings on proximal Xp [DeLisi et al., 1994; Dann 
et al., 1997]. However, the power available in 70 fami- 
hes may not be sufficient to detect hnkage to X-Y ho- 
mologous genes. 

In summary, there are several regions that need 
closer examination based on the results of the present 
genome screen for a schizophrenia susceptibility locus 
(or loci). These are within chromosomes 1, 2, 4, 5, 8, 13, 



16, and 22, and some have received fiirther support 
trom other independent genome-wide screens. Never- 
theless, none of these regions reach conventional levels 
for statistical evidence of linkage, as proposed by 
Lander and Kruglyak [1995], whose guidehnes for "sig- 
mficant hnkage" in genome-wide scans indicate that a 
P value of less than 2 x 10"^ or a lod score greater than 
3.6 IS needed. Thus, it must be considered that all find- 
ings in this genome scan could be the result of chance, 
and that none of the loci discussed actually represent 
true schizophrenia susceptibihty loci. This possibiHty 
IS increased by the fact that we have analyzed multiple 
phenotype classifications, which further influence the 
type I error rate. 

One interpretation of the results from this study and 
previous studies reported to date is that one major lo- 
cus IS unlikely to exist for schizophrenia, since many 
families have already been screened with no lod score 
reaching conventional levels of significance. However 
if a gene of minor effect is to be found, the power froni 
70 predominantly nuclear famihes is too low here for 
significant results to be hkely to appear (see Table III) 
Sawcer et al. [1997] suggested that the guidelines of 
Lander and Kruglyak [1995] are too stringent for ge- 
nome scans searching for complex disease genes, such 
as schizophrenia. 

Regardless of one's definition of 'linkage" and wheth- 
er there IS one or several genes involved, this genome- 
wide search has provided a few regions worth further 
investigation. The present study is now being extended 
in the laboratory to densely cover these regions with 
more markers in approximately 1-3-cM intervals and 
to expand the number of famihes at least 3-fold who 
will receive systematic genomic screening. Replication 
of similar findings by other independent investigative 
groups will also aid in vahdating the positive scores 
from the present study. 

Finally, the controversy over phenotypic character- 
istics corresponding to the genotype has not been clari- 
fied by the present study. No diagnostic inclusion cri- 
teria consistently yielded maximal scores for each sug- 
gestive locus; rather, the peak lod scores varied among 
markers for different diagnostic categories and types of 
analyses. While it is possible that different loci may 
contribute to different diagnostic subtypes, it is doubt- 
ful that this dilemma can be solved until a definite 
hnkage to a locus with an associated mutant variant 
has been determined. Families for the present initial 
study were chosen conservatively so that at least one 
member of a sib-pair had a nonaffective form of schizo- 
phrenia (without any depressive or manic compo- 
nents). However, when all schizoaffective individuals 
were eliminated from analyses, the power for detecting 
hnkage significantly lowered; thus it cannot be deter- 
mined whether the reduced power or the lack of accu- 
rately estimating the phenotypic boundaries is contrib- 
uting to the variable results. 

REFERENCES 

Antonar^is SE. Blouin J-L, Pulver AE. Wolyniec P, Lasseter VK, Nestadt 
UI^Kasch L, Babb R, Kazazian HH, Dombroski B, Kimberland M Ott 
J, Housman D, Karayiorgou M, MacLean CJ (1995): Schizophrenia 
susceptibility and chromosome 6p24-22. Letter to the Editor. Nat 
tjenet 11:235—236. 



Antonarakifi SE, Blouin JL, Curran M, Luebbert H, Kazazian HH, Dom- 
brioski B, Housman D, Ton C, Karayiorgou M, Chakravarti A, 
Wolyniec P, Lasseter VK, Nestadt G, Pulver AE (1996): Linkage and 
sib-pair analysis reveal a potential schizophrenia susceptibility gene on 
chromosome 13q32. Psychiatr Genet 6:136. 

Barr CL, Kennedy JL, Pakstis L, Wetterberg L» Sjogren B, Bierut L, 
Wadelius C, Wahlstrom J, Martinsson T, Giuffra L, Gelemter j\ 
Hallmeyer J, Moises HW, Kurth J, CavalU-Sforza LL, Kidd KK(1994): 
Progress in a genome scan for linkage in schizophrenia in a large Swed- 
ish kindred. Am J Med Genet 54:51-58. 

Byerley W, Polloi A, Dale P, HoffM, Rosenthal J, Tiobech J, Reimherr F, 
Paul W, Myles-Worsley M, Coon H (1997): Genome-wide linkage analy- 
sis of a large schizophrenia pedigree ascertained from Palau, Microne- 
sia. Am J Med Genet 74:559-560. 

Carlson C, Papolos D, Pandita RK, Faedda G, Veit S, (5oldberg R, Shprint- 
zen R, Kucherlapati R, Morrow B (1997): Molecular analysis of velo- 
cardio-facial syndrome patients with psychiatric disorders. Am J Hum 
Genet 60:851-859. 

Coon H, Holik J, HoffM, Reimberr F, Wender P, Freedman R, Byerley W 
(1994a): Analysis of chromosome 22 markers in 9 schizophrenia pedi- 
grees. Am J Med Genet 54:72-79. 

Coon H, Jensen S, Holik J, HoffM, Myles-Worsley M, Reimherr F, Wender 
P, Waldo M, Freedman R, Leppert M, Byerley W (1994b): Genomic scan 
for genes predisposing to schizophrenia. Am J Med (3enet 54:59-71. 

Cooperative Human Linkage Center (1994): A comprehensive human link- 
age map with centimorgan density. Science 265:2049-2054. 

Cottingham R, Idury R, Schaffer A (1993): Faster sequential genetic link- 
age computations. Am J Hum Genet 53:252-263. 

Crowe RR, Black DW, Wesner R, Andreasen NC, Cookman A, Roby J 
(1991): Lack of linkage to chromosome 5qll-ql3 markers in six schizo- 
phrenia pedigrees. Arch Gen Psychiatry 48:351-^61. 

Dann J, DeLisi LE, Devoto M, Laval S, Nancarrow DJ, Shields G, Smith A, 
Loftus J, Peterson P, Vita A, Comazzi M, Invemizzi G, Levinson DF, 
Wildenauer D, Mowry BJ, ColUer D, Powell J, Crowe RR, Andreasen 
NC, Silverman JM, Mohs RC, Murray RM, Walters MK, Lennon DP, 
Hayward NK, Albus M, Lerer B, Maier W, Crow TJ (1997): A linkage 
study of schizophrenia to markers within Xpll near the MAOB gene. 
Psychiatry Res 70:131-143. 

DeLisi LE, Crow TJ ( 1989): Evidence for a sex chromosome locus for schizo- 
phrenia. Schizophr Bull 15:431-440. 

DeLisi LE, Devoto M, Lofthouse R, Poulter M, Smith A, Shields G, Bass N, 
Chen G, Vita A, Morganti C, Ott J, Crow TJ (1994): Search for linkage 
to schizophrenia on the X and Y chromosomes. Am J Med Genet 54: 
113-121. 

DeLisi LE, Kelly M, Shaw S, Cardon L, Smith AB, Hopkins P, Shields G, 
Loftus L, Laval S, Crow TJ, Sherrington R (1997): Preliminary results 
of an international genomic scan for schizophrenia. Am J Med Glenet 
74:558. Presented at the Fifth World Congress of Psychiatric (^netics 
October 20, 1997. 

Dib CS, Faure S, Fizames C, Samson D, Dourot N, Vignal A, Millasseau P, 
Marc S, Hazan J, Seboun E, Lathrop M, Gyapay G, Morissette J, Weis- 
senbach J (1996): A comprehensive genetic map of the human genome 
based on 5,264 microsatelUtes. Nature 380:152-154. 

Gamer C, Kelly M, Cardon L, Joslyn G, Carey A, LeDuc C, Lichter J, 
Harris T, Loftus J, Shields G, Comazzi M, Vita A, Smith AM, Dann J, 
Crow TJ, DeLisi LE (1996): Linkage analyses of schizophrenia to chro- 
mosome 6p24-p22: An attempt to replicate. Am J Med Genet 67:595- 
610. 

Gill M, Vallada H, Collier D, et al. (1996): A combined analysis lod 
D22S278 marker alleles in affected sib-pairs: Support for a suscepti- 
bility locus for schizophrenia at chromosome 22ql2. Schizophrenia (Col- 
laborative Linkage Group (Chromosome 22). Am J Med Genet 1:40-45. 

Curling H, Kalsi G, Chen AH-S, Green M, Butler R, Read T, Murphy P 
(1995): Schizophrenia susceptibility and chromosome 6p24-22. Letter 
to the Editor. Nat Genet 11:234-235. 

Hall JM, LeDuc CA, Watson AR, Roter AH (1996): An approach to high- 
throughput genotyping. Genome Res 6:781-790. 

Horvatta I, Varilo T, Suvisaari J, Vaisanen JD, Terwilliger J, Lonnqvist J, 
Peltonen L (1997): A genome-wide search for schizophrenia genes in an 
internal isolate in Finland. Am J Med Genet 74:559. 

Kalsi G, Bryiyolfsson J, Butler R, et al. (1995): Linkage analysis of chro- 
mosome 22ql2-Kil3 in a United Kingdom/Icelandic sample of 23 mul- 
tiplex schizophrenia families. Am J Med Genet 60:298-301. 

Kendler KS (1988): The genetics of schizophrenia: An overview. In Tsuang 
MT, Simpson JC (eds): "Handbook of Schizophrenia, Volume 3, Nosol- 



Genome-Wide Search for Schizophrenia 375 

ogy, Epidemiology and Genetics of Schizophrenia." Amsterdam: 
Elsevier, pp 437-462. 

Kendler KS, McGuire M, Gruenberg AM, Spellman M, OTiare A, Walsh D 
(1993): The Rosconunon family study. 11. The risk of non-schizophrenic 
nonaffective psychoses in relatives. Arch Gen Psychiatry 50:645-652. 

Kendler KS, Neale MC, Walsh D (1995): Evaluating the spectrum concept 
of schizophrenia in the Roscommon family study. Am J Psychiatry 
152:74^-754. ^ y 

Kendler KS, MacLean CJ, O'Neill FA, Burke J, Murphy B, Duke F, Shink- 
win R, Easter SM, Webb BT, Zhang J, Walsh D, Straub RE (1996): 
Evidence for a schizophrenia vubierability locus on chromosome 8p in 
the Irish study of high-density schizophrenia families. Am J Psychiatry 
153:1534-1540. 

Kety SS, Wender PH, Jacobsen B, Ingraham LJ, Jansson L, Faber B, 
ICinney DK (1994): Mental illness in the biological and adoptive rela- 
tives of schizophrenic adoptees. Replication of the Copenhagen study in 
the rest of Denmark. Arch Gen Psychiatry 51:442-455. 

Kruglyak L, Daly MJ, Reeve-Daly MP, Lander ES (1996): Parametric and 
nonparametric linkage analysis: A unified multipoint approach. Am J 
Hum Genet 58:1347-1363. 

Lander E, Kruglyak L (1995): Genetic dissection of complex traits: Guide- 
Unes for interpreting and reporting linkage results. Nat Genet 11-241- 
247. 

Lathrop GM, Lalouel JM, JuHer C, Ott J (1984): Strategies for multilocus 
Unkage analysis in humans. Proc Nati Acad Sd USA 81:3443-3446. 

Levinson DF, Mahtani MM, Brown D, Nancarrow DJ, Kirby A, Daly M, 
Crowe RR, Andreasen N, SUverman JM, Mohs RC, Endicott J, Sharpe 
L, Walters MK, Lennon DP, Hayward NK, Kruglyak L, Mowry BJ 
(1996): As genome scan of schizophrenia. Presented at the 1996 World 
Congress on Psychiatric (^netics, San Francisco, California, October 
28, 1996. Psychiatr Genet 6:140. 

Lin MW, Curtis D, Williams N, et al. (1995): Suggestive evidence for link- 
age of schizophrenia to markers on chromosome 13ql4.1-q32. Psychi- 
atr Genet 3:117-126. 

Matise TC, Perlin M, Chakravarti A (1994): Automated construction of 
genetic linkage maps using an expert system (MultiMap): A human 
genome linkage map. Nat Genet 6:384-390. 

McGuffin P, Farmer AE, Ciottesman 11, Murray RM, Reveley AM (1984): 
Twin concordance for operationally defined schizophrenia. Confirma- 
tion of famiUarity and heritability. Arch Gen Psychiatry 41:541-545. 

Moises HW, Yang L, Kristbjamarson H, Wiese C, Byerley W, Macciardi F, 
Arolt V, Blackwood D, Liu X, Sjogren B, Aschauer HN, Hwu H-G, Jang 
K, Livesley WJ, Kennedy JL, Zoega T, Ivarsson O, Bui M-T, Yu M-H, 
Havsteen B, Commenges D, Weissenbach J, Schwinger E, Gottesman 
II, Pakstis AJ, Wetterberg L, Kidd KK, Helgason T (1995a): An inter- 
national two-stage genome-wide search for schizophrenia susceptibility 
genes. Nat Genet 11:321-324. 

Moises HW, Yang L, Li T, Havsteen B, Fimmers R, Baur MP, Liu X, 
C^ttesman II (1995b): Potential linkage disequilibrium between schizo- 
phrenia and locus D22S278 on the long arm of chromosome 22 Am J 
Med Genet 5:465-467. 

Mowry BJ, Nancarrow DJ, Lennon DP, Sandkuyl LA, Crowe RR, Silver- 
man JM, Mohs RC, Siever LJ, Endicott J, Sharpe L, Walters MK, 
Hayward NK, Levinson DF (1995): Schizophrenia susceptibility and 
chromosome 6p24-22. Letter to the Editor. Nat <3enet 11:233-234. 

NIMH CSenetics Initiative (1997): A genome-wide search for schizophrenia 
susceptibiUty loci: The NIMH Genetics Initiative and Millennium Con- 
sortium. Presented by S. Faraone at the Fifth World Congress of Psy- 
chiatric CSenetics, Santa Fe, New Mexico, October 20, 1997. Am J Med 
Genet 74:556-557. 

Numberger JI, Blehar MC, Kaufiman CA, York-Cooler C, Simpson SG, 
Harkayy-Friedman J, Severe JB, Malaspina D, Reich T (1994): Diag- 
nostic interview for genetic studies: Rationale, unique features and 
training. Arch Gen Psychiatry 51:849-862. 

Ott J (1991): "Analysis of Hiunan Linkage." Baltimore: Johns Hopkins 
University Press. 

Parsian A, Suarez BK, Isenberg K, Hampe CL, Fisher L, Chakraverty S, 
Meszaros K, Lenzinger E, WilUnger U, Fuchs K, Aschauer HN, Clon- 
inger CR (1997): No evidence for a schizophrenia susceptibility gene in 
the vicinity of IL2RB on chromosome 22. Am J Med Genet 74:361-364. 

Pfohl B, Blum N, Zimmerman M, Stangl D (1990): "Structured interview 
for DSM-III-R Personality Disorders-Revised (SIDP-R)." Department 
of Psychiatry, University of Iowa. 

Polymeropoulos MH, Coon H, Byeriey W, Gershon ES, (Soldin L, Crow TJ, 
Rubenstein J, Hoff H, Holik J, Smith AM, Shields G, Bass NJ, Poulter 



376 Shaw et al. 



M, Lofthouse R, Vita A, Morganti C, Merril CR, DeLisi LE (1994): 
Search for a schizophrenia susceptibiUty locus on human chromosome 
22. Am J Med Genet 54:93-99. 

Polymeropoulos MH, Devoto M, Rubenstein J, Lofthouse R, Smith A, 
Shields G, Bass N, Crow TJ, DeLisi LE (1995): A random search of the 
genome in nuclear families with schizophrenia. Psychiatr Genet 5:51. 

Pulver AE, Karayiorgou M, Wolyniec PS, Lasseter VK. Kasch L, Nestadt G, 
Antonarakis S, Housman D, Kazazian HH, Meyers D, Ott J, Lamacz M, 
Liang K-Y, Hanfelt J, Ullrich G, DeMarchi N, Ramu E, McHugh PR^ 
Adler L, Thomas M, Carpenter WT, Manschreck T, (Gordon CT, Kim- 
berland M, Babb R, Puck J, Childs B (1994a): Sequential strategy to 
identify a susceptibiUty gene for schizophrenia: Report of potential 
Unkage on chromosome 22ql2-ql3.1: Part L Am J Med Genet 54-36- 
43. 

Pulver AE, Karayiorgou M, Lasseter VK, et al. (1994b): Follow-up of a 
report of a potential linkage for schizophrenia on chromosome 22ql2- 
ql3.1: Part 2. Am J Med Genet 54:44-^50. 

Pulver AE, Lasseter VK, Kasch L, et al. (1995): Schizophrenia: A genome 
scan targets chn)mosomes 3p and 8p as potential sites of susceptibility 
genes. Am J Med Genet 60:252-260. 

Risch N (1990): Linkage strategies for genetically complex traits. IL The 
power of affected relative pairs. Am J Hum CJenet 46:229-241. 

Sawcer S, Jones HB, Judge D, Visser F, Compston A, Goodfellow PN, 
Clayton D (1997): Empirical genome-wide significance levels estab- 
lished by whole genome simulations. Genet Epidemiol 14:223-229. 

Schizophrenia Linkage (Collaborative Group for Chromosomes 3, 6, and 8 
(1996): Additional support for a schizophrenia linkage to chromosomes 
6 and 8. Am J Med CJenet 67:580-594. 

Schwab SG, Albus M, Hallmayer J, et al. (1995a): Evaluation of a suscep- 
tibility gene for schizophrenia on chromosome 6P by multipoint af- 
fected sib-pair analysis. Nat Genet 11:325-^327. 

Schwab SG, Lerer B, Albus M, Maier W, Hallmeyer J, Fimmers R, Lichter- 
mann D, Minges J, Bondy B, Ackenheil M, Altmark D, Hasib D, Gur E, 
Ebstein RP, Wildenauer DB (1995b): Potential linkage for schizophre- 
nia on chromosome 22ql2-ql3: A replication study. Am J Med Genet 
60:436-443. 

Sherrington R, Biyiyolfsson J, Petursson H, Potter M, Dudleston K, Barra- 
clough B, Wasmuth J, Dobbs M, Gurling H (1988): Localization of a 
susceptibility locus for schizophrenia on chromosome 5. Nature 336: 
164-167, 

Silverman JM, Greenberg DA, Altstiel LD, et al. (1996): Evidence of a locus 
for schizophrenia and related disorders on the short arm of chromo- 
some 5 in a large pedigree. Am J Med Genet 67:162-171. 



Spitzer RL, Endicott J (1978): "Schedule for Affective Disorders and 
Schizophrenia (SADS)/ New York: Biometrics Research Division, New 
York State Psychiatric Institute. 

Straub RE, MacLean CJ, O'Neill FA, Burke J, Murphy B. Duke F, Shink- 
win R, Webb BT, Zhang J, Walsh D, Kendler KS (1995): A potential 
vuhierability locus for schizophrenia on chromosome 6p24-22: Evi- 
dence for genetic heterogeneity. Nat Genet 11:287-293. 

Straub RE, MacLean CJ, O'Neill FA, Walsh D, Kendler KS (1997a): <3e- 
nome scan for schizophrenia genes. Presented at the Fifth World Con- 
gress of Psychiatric Genetics, Santa Fe, New Mexico, October 20 1997 
Am J Med Genet 74:558. 

Straub RE, MacLean CJ, O'Neill FA, Walsh D, Kendler KS (1997b): Sup- 
port for a possible schizophrenia vulnerability locus in a region of 
5q22-31 in Irish famiUes. Mol Psychiatry 2:148-155. 

Utah Marker Development Group (1995): A collection of ordered tetra- 
nucleotide-repeat markers from the human genome. Am J Hum Genet 
57:619-628. 

Wang S, Sun C, Walczak CA, Ziegle JS, Kipps BR, Goldin LR, Diehl SR 
(1995): Evidence for a susceptibility locus for schizophrenia on chromo- 
some 6ptei^p22. Nat Genet 10:41-46. 

Wildenauer DN (1996): Chromosome 18 and schizophrenia. Presented at 
the 1996 Annual Meeting of the ACNP, San Juan, Puerto Rico, Decem- 
ber, 1996. 

Wildenauer DB, Albus M, Schwab SG, HalUnayer J, Hansea C, Eckstein 
GN, Zill P, Honig S, Lerer B, Ebstein R, Lichtermann D, Trixler M, 
Boirmann M, Maier W (1997): Searching for susceptibiUty genes in 
schizophrenia by affected sib-pair analysis. Presented at the Fifth 
World Congress of Psychiatric Genetics, Santa Fe, New Mexico, Octo- 
ber 20, 1997. Am J Med Genet 74:558. 

Williams NM, Rees MI, Holmans P, Daniels J, Fenton I, Cardno AG, Mur- 
phy KC, Jones LA, Asherson P, McGuffin P, Owen MJ (1997): CJenome 
search for schizophrenia susceptibility genes using a two-stage sib-pair 
approach. Presented at the Fifth World Congress of Psychiatric (Genet- 
ics, Santa Fe, New Mexico, October 20, 1997. Am J Med Cienet 74:559. 

Schizophrenia Collaborative Lmkage Group {Chromosome 22) (1996): A 
combined analysis of D22S278 marker alleles in affected sib-pairs: 
Support for a susceptibiUty locus for schizophrenia at chromosome 
22ql2. Am J Med Genet 67:40-45. 

Vallada HP, Gill M, Sham P, Urn LC, Nanko S, Asherson P, Murray RM, 
McGuffin P, Owen M, ColUer D (1995): Unkage studies on chromosome 
22 in famiUal schizophrenia. Am J Med (Genet 60:139-146. 



Proc. Natl. Acad. Sci. USA 

Vol. 96, pp. 5604-5609, May 1999 

Genetics 



A high-density genome scan detects evidence for a 
bipolar-disorder susceptibility locus on 13q32 and 
other potential loci on lq32 and 18pll.2 

SEVILLA D. DETERA-WADLEIGH*t, JUDITH A. BaDNER**, WADE H. BERRETnNI§, TaKEO YoSHIKAWA* 

Lynn R. Goldin*^^, Gordon Turner*, Denise Y. Rollins*, Tracy Moses*, Alan R. Sanders**, 
Jayaprakash D. Karkera*, Lisa E. Esterling*, Jin Zeng*, Thomas N. Ferraro§, Juliet J. Guroff*, 
Diane Kazuba*, Mary E. Maxwell*, John L Nurnberger, jR.ii, and Elliot S. Gershon** 

*C!inical Nc"rogcn^^^^ Branch, National Institute of Mental Health, and 'Genetic Epidemiology Branch, National Cancer Institute, National Institutes of Health 
Bethesda, MD 20892; ^Department of Psychiatry, University of Chicago, Chicago, IL 60637; §Centcr for Neurobiology and Behavior, Department of Psychiatry ' 
University of Pennsylvania, Philadelphia, PA 19104; and "Medical and Molecular Genetics and Psychiatry, Indiana University. Indianapolis, IN 46202 

Edited by Newton E. Morton, University of Southampton, United Kingdom, and approved March 3, 1999 (received for review December 18, 1998) 



ABSTRACT Bipolar disorder is a severe mental illness 
characterized by mood swings of elation and depression. 
Family, twin, and adoption studies suggest a complex genetic 
etiology that may involve multiple susceptibility genes and an 
environmental component. To identify chromosomal loci con- 
tributing to vulnerability, we have conducted a genome-wide 
scan on «*396 individuals from 22 multiplex pedigrees by 
using 607 microsatellite markers. Multipoint nonparametric 
analysis detected the strongest evidence for linkage at 13q32 
with a maximal logarithm of odds (lod) score of 3.5 (F = 
0.000028) under a phenotype model that included bipolar I, 
bipolar II with major depression, schizoaffective disorder, and 
recurrent unipolar disorder. Suggestive linkage was found on 
Iq31-q32 (lod = 2.67; P ^ 0.00022) and 18pll.2 (lod = 2.32; 
P = 0.00054). Recent reports have linked schizophrenia to 
13q32 and 18pll.2. Our genome scan identified other inter- 
esting regions, 7q31 (lod = 2.08; P - 0.00099) and 22qll-ql3 
(lod = 2.1; P = 0.00094), and also confirmed reported linkages 
on 4pl6, 12q23-q24, and 21q22. By comprehensive screening 
of the entire genome, we detected unreported loci for bipolar 
disorder, found support for proposed linkages, and gained 
evidence for the overlap of susceptibility regions for bipolar 
disorder and schizophrenia. 



Bipolar disorder is a complex neuropsychiatric disease of 
unknown genetic etiology. Theories concerning multigenic 
inheritance involving multiple genes that exert varying mag- 
nitudes of effect on overall susceptibility have been advanced, 
but these genes have yet to be identified. Early linkage studies 
on bipolar disorder have been marked with uncertainty (1), 
possibly because of difficulties related to the power of detect- 
ing weak- to moderate-effect genes (2-4), to other confound- 
ing factors, or to spurious reported findings. Recent advances, 
which include the availability of dense maps containing highly 
informative markers, the use of extended pedigrees or larger 
sample collections, and the application of newer analytical 
methods, have propelled reports showing evidence of linkage 
to chromosomes 4pl6 (5), 12q23-q24 (6), 18pcen-qcen (7, 8), 
18q22-q23 (9), and 21q22.3 (10). 

Previous chromosomal screens on our 22 multiply affected 
pedigrees elicited evidence of linkage to chromosome 18 (7, 8) 
and support for the proposed linkage on 21q22.3 (11). In 
contrast, our earlier scans that used sparsely spaced and 
moderately informative markers have not uncovered areas of 
linkage (12, 13). In the present study, we performed a high- 
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density genome scan by using highly polymorphic markers to 
identify chromosomal regions that contribute to the genetic 
risk of bipolar disorder. Here, we report evidence for bipolar- 
disorder susceptibility loci, support of prior linkages, and 
possible overlaps of regions implicated in bipolar disorder and 
schizophrenia. 

METHODS 

Pedigrees. A panel of 22 multiplex pedigrees was used for 
the genome screening, and the diagnostic and ascertainment 
methods used for 21 of these have been described in detail 
elsewhere (14). These families included «*365 informative 
persons (i.e., those persons whose genotype can be determined 
either directly or indirectly; ref. 14). The 22nd family was the 
"right extension" of the Old Order Amish pedigree 110 (7). 

To address the issue of uncertainty regarding the spectrum 
of illness that is inherited, two hierarchical models of the 
affected phenotype were used in linkage analysis. For affection 
status model (ASM) I, the definition of the affected category 
was restricted to bipolar I, bipolar II with major depression, 
and schizoaffective disorder. For the second model, ASM II, 
the phenotype definition was broadened to include those 
individuals with two or more episodes of major depression. 
Each pedigree had a minimum of four affected individuals 
under ASM II. There were 396 individuals that were available 
for genotyping, including 117 with ASM I and 159 with ASM 
II diagnoses. 

Although we excluded branches of a pedigree when a person 
marrying into the pedigree had any of the diagnoses included 
in ASM I and ASM II, bilineality could not be ruled out, 
particularly if a multigenic model with common susceptibility 
alleles is assumed. However, when nonparametric methods are 
used, this issue may not be relevant. Bilineal families may 
reduce power but would not be expected to increase the 
incidence of false-positive results. 

The power of this pedigree series to detect linkage has been 
examined in a prior study; this study used assumed parameters, 
because the true genetic model for bipolar disorder is not 
known (14). Simulations of linkage using 21 of the 22 families 
included in the present genome scan were performed under a 
dominant transmission model and ASM II by using the simlink 
program (15). Assuming the presence of heterogeneity, there 
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was >50% power when at least 25% of the families are linked, 
for a marker with a polymorphism information content of 0.7 
at 6 = 0.01. In a multigenic model, power depends on the 
effect of the susceptibility gene and whether it is acting in a 
multiplicative or additive fashion with other predisposing 
genes. 

Genotyping. The microsatellite markers used for genotyping 
consisted of tandem repeated dinucleotides and tetranucleo- 
tides, as well as a few trinucleotides. Markers were chosen from 
the following databases: the Cooperative Human Linkage 
Center (CHLC; www.chlc.org); the CHLC/Weber screening 
set, versions 6 and 8 (Research Genetics, Huntsville, AL); 
Genethon (ftp://ftp.genethon.fr/pub/Gmap/Nature-1995 /); 
the Center for Medical Research of the Marshfield Medical 
Research Foundation (www.marshmed.org/genetics/); and 
the Genetic Location Database (ref. 16; cedar.genetics.so- 
ton.ac.uk/public_html/). A few markers were taken from the 
map created by the National Institutes of Health/Centre 
d'fitude du Polymorphisme Humain (Paris) Collaborative 
Mapping Group (mapper.wustl.edu/basemaps/index.html). 
The Marshfield map included almost all of the markers, and, 
in other instances, intermarker distance and order were de- 
duced from the pedigrees by using crimap [Green, P., Falls, K. 
& Crooks, S. (1990) crimap Documentation (Department of 
Genetics, School of Medicine, Washington University, St. 
Louis), Version 2.4; linkage.rockefeller.edu/soft/crimap/]. 

With few exceptions, PCR amplification was done by using 
one set of cycling conditions: 95°C for 5 min for the initial 
denaturation, 30 cycles of denaturation at 94°C for 30 s, 
annealing at 55°C for 30 s, and extension at 72°C for 1 min. A 
final extension of 72''C for 10 min was done. Markers that 
failed under these conditions were replaced by other markers 
mapping to the same region. The controls CEPH 1333-01 and 
1333-02 were placed in specific lanes with each set of samples 
to obtain consistent assignment of allele sizes from gel to gel. 
Most of the genotypings were radioactivity-based, and each 
autoradiogram was read by two independent readers who were 
blind to diagnosis. As size standards, we used an M13 sequence 
ladder and a 123-bp ladder. Automated genotyping was done 
with an Applied Biosystems (ABI) 373 sequencer (Perkin- 
Elmer), and alleles were scored with the genescan and 
GENOTYPER programs. Unlabeled and fluorescently labeled 
primers were purchased from Research Genetics, although a 
few were synthesized through BioServe Biotechnology (Gaith- 
ersburg, MD). Mendelian inconsistencies were resolved before 
linkage analysis, and marker allele frequencies were computed 
from individuals marrying into the pedigree. 

Linkage Analysis. Linkage was evaluated under the affected 
phenotype classifications of ASM I and ASM II. Pointwise 
affected-sib-pair (ASP) analysis was done by including all ASPs 
by using the SIBPAL program, version 3.0, in the sage package 
(17). Multipoint analyses were performed by using aspex 
[Hinds, D. & Risch, N. (1996) aspex, version 1.62, sib_phase 
package; ftp://lahmed.stanford.edu/pub/aspex] and gene- 
hunter-plus (GHP), version 1.1 (18, 19). Sex-averaged maps 
were used for multipoint analyses. For chromosome 18, sex- 
specific maps were used in additional calculations. For the GHP 
analysis, individuals that were uninformative or that were 
connected to the pedigree by more than two uninformative 
generations were deleted. The right extension of the Amish 
pedigree 110 was divided into two families. Pedigrees were 
analyzed using the all function of ghp, which examines all 
individuals simultaneously and assigns a higher score when 
more of them share the same allele by descent. The probability 
of the observed sharing is calculated by using a semiparametric 
logarithm of odds (lod) score as described (19). 

GHP analysis also was done after splitting the pedigrees into 
nuclear families (GHPfam). It has been shown through simu- 
lation that in GHP analysis, more distant relatives provide less 
information when a susceptibility allele is common (20). The 



reverse is true when the allele is rare. GHPfam was used when 
there was a sign if icant difference between GHP and aspex data. 
If the results from GHPfam are similar to GHP, then the 
differences likely were caused by the different algorithms. On 
the other hand, if the GHPfam results are similar to aspex, then 
the family structure being analyzed likely is affecting the 
results. 

Parametric lod score analysis was conducted by using 
FASTLINK, version 3.0 P [Schaffer, A. A., Gupta, S. K., Shriram 
K. & Cottingham, R. W., Jr. (1997); ftp://fastlink.nih.gov/ 
pub/fastlink]. Pairwise lod score calculations were done under 
a dominant model with 85% and 50% penetrance and under 
a recessive model with 85% penetrance. 

Because we have employed different methods of analysis 
under two phenotype definitions, the most significant signals 
in any specific region may be inflated because of multiple 
testing. Biases in each analysis caused by deviations from 
assumptions also could occur. We believe, however, that the 
consistency of results derived from different analytical meth- 
ods is important to consider, particularly if two or more 
methods detect lod scores of >2 in the same region. More- 
over, support from other studies on independently ascer- 
tained families is critical to strengthening the validity of a 
finding. 

RESULTS AND DISCUSSION 

Genome-Wide Screening Data Analyzed by Nonparametric 
Methods. We have screened 607 polymorphic loci covering all 
22 autosomes and the X chromosome, with an average marker 
spacing of -«6 centimorgans (cM) and an average heterozy- 
gosity of =«60%. Model-free calculations identified clusters of 
loci in several regions of the genome that displayed excess 
identical-by-descent (IBD) sharing. 

The strongest evidence for linkage derived from nonpara- 
metric analysis centered at chromosome 13q32, the only region 
that yielded a lod score of >3 (Table 1; Fig. 1). Pointwise 
analysis depicted excess IBD sharing in an '=«17-cM region 
maximizing at D13S1271 (64%; P = 0.0002) under ASM I. 
Analysis by GHPfam using nuclear families detected the highest 
multipoint lod at the D13S1252-D13S1271 interval (lod = 3.5; 
P = 0.000028) under ASM II. aspex localized the maximal 
peak in the D13S1271-D13S779 region for both ASM I (lod = 
3.4; P = 0.000039) and ASM II (lod - 3.3; P = 0.000051). 
Broadening the classification of affected phenotypes to include 
recurrent major depression did not have a significant effect on 
the lod score. Based on statistical criteria proposed by Lander 
and Kruglyak (21), these values approach significant linkage 
(lod = 3.6; P = 0.000022) for a genome scan. Consistent with 
this finding was an excess sharing in this region in the 97 bipolar 



Table 1. Multipoint linkage data derived through aspex 
lod of >1 {P < 0.01) 



Chromosome 


Region 


ASM 1 (P) 


ASM II (/>) 


Iq25-q32 


S471-S237 


1.78 (0.0021)* 




2pter 


S2976 


2.0 (0.0012) 


2p25-p24 


S1400-S1360 


1.11 (0.012) 


4pl6-p]4 


S2408-S2632 


1.77 (0.0022) 


5q33-q35 


S498-S408 


1.16(0.01) 


1.7(0.0026) 


llql3 


S1883-S913 


1.89 (0.0016) 


12q22-q24 


S1343-S2070 


1.24 (0.0084) 


13q32 


S1271-S779 


3.4 (0.000039) 


3.3 (0.00005 l)t 


14q24 


S1434-S65 


1.04 (0.014) 


14q24-q32.2 


S617-S1434 


1.23 (0.0087) 


18pll.2 


S1150-S71 


2.32 (0.00054) 


2.03 (0.0011) 


21q22.1 


S1254-S65 


1.85 (0.0018) 


22qll-ql3 


S689-S685 


1.26 (0.008) 


2.1 (0.00094) 



*GHPfam yielded lod = 2.67 (P = 0.00022) at D1S1660-D1S1678. 
TGHPfam yielded lod = 3.5 (P = 0.000028) at D13S1252-D13S1271. 
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Chromosome 13 (ASM I) 
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pedigrees of the National Institute of Mental Health Genetics 
Initiative (22). These data suggest a prominent role for a gene 



encoded by 13q32 in the genetic basis of affective disorder in 
these pedigrees. It is noteworthy that the susceptibility region 
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on 13q32 overlapped the most significant region in a schizo- pedigrees showed that there was at least one individual with 
phrenia series (23). Examination of the diagnosis in the current schizoaffective disorder or schizophrenia in 15 of the families. 
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Conceivably, the 13q32 gene is a predisposing factor for 
psychoses. 

Regions on chromosomes 1, 7, 18, and 22 showed excessive 
allele sharing, yielding maximal multipoint lods of >2 but <3 
(Table 1; Fig. 2), with values for Iq and ISp exceeding 
suggestive linkage (21). Pointwise analysis disclosed elevated 
IBD sharing, in an '=«30-cM region spanning Iq25-q42, which 
was more pronounced under ASM I. Addition of recurrent 
major depression to the stringent phenotypes, as in ASM II, 
may increase power, because it expands the number included 
in the affected category. On the contrary, this addition could 
have the opposite effect, because the genetic heterogeneity of 
the sample would be magnified. Multipoint analysis by GHPfam 
produced a peak of allele sharing at Iq25-q32 (ASM I lod = 
2.67; P = 0.00022). Complementing this finding was a study 
that showed a frequent fragile site on lq32 in bipolar patients 
(24). Together, these results raise the possibility that disrup- 
tion of a gene on lq32 contributes to overall susceptibility to 
bipolar disorder. 

The candidate region on chromosome 18 encompassed a 
large area surrounding the centromere (7, 8). We increased 
marker coverage to an average spacing of 3.2 cM and localized 
the highest peak between D18S1150 and D18S71 (ASM I lod = 
2.32; F = 0.00054) at 18pll.2 (25) by using aspex (Fig. 1). 
There was no significant discrepancy between the peak profiles 
for ASM I and ASM II. A telomeric, less prominent peak at 
-18pll.3 (ASM II GHP lod = 1.44; P = 0.005) and another 
small peak at «-18q21 (lod = «^1) also were noted. These weak 
signals may be relevant because an inversion involving 18pll.3 
and 18q21.1 seemed to cosegregate with affective disorder in 
some families (26), and 18q21 included a proposed region for 
bipolar disorder (27, 28). 

Additional multipoint aspex analysis was performed by 
using both male- and female-specific maps (data not shown). 
Both analyses generated peak height and location that were 
roughly similar to those found when the sex-averaged map was 
used (Fig. 1). A previous study postulated a parent-of-origin 
effect on bipolar disorder in this chromosome (27). We 
extended the analysis to calculate parental-specific lods by 
using sex-specific maps. Paternal allele sharing under ASM I 
was increased, consistent with a previous finding on these 
pedigrees (29). However, this increase seemed to be caused by 
a single large family in which the paternal haplotype was 
shared (data not shown). 

A recent study implicated 18plL2 in psychoses (30). Evi- 
dence for linkage and association in schizophrenia families 
increased when affective diagnoses were included in the af- 
fected category (30). The emergence of overlaps for bipolar 
disorder and schizophrenia loci on 13q32 (23) and 18pll.2 (30) 
is consistent with family studies that showed an increased risk 
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among first degree relatives of schizophrenia probands for 
schizoaffective and recurrent unipolar disorders (31, 32). 

Neither sibpal nor aspex detected a region of excess IBD 
sharing on chromosome 7. In contrast, ghp gave a lod of 2.08 
{P = 0.00099) under ASM II between D7S1799 and D7S501 at 
7q31. Coincident with this location was an area with elevated 
allele sharing in the bipolar series of the National Institute of 
Mental Health Genetics Initiative (33). The divergent results 
derived through aspex and ghp may be ascribed to random 
chance or to a rare susceptibility allele that is more likely to be 
detected when extended pedigrees are studied (20). 

On chromosome 22, the maximal signal spanned D22S689 
and D22S685 on 22qll-ql3 (ASM II ASPEX lod = 2.1; P - 
0.00094). In this region, microdeletions have been found in 
velo-cardio-facial syndrome, a dysmorphism associated with an 
enhanced prevalence of psychoses (34). Significant linkage to 
bipolar disorder (J. Kelsoe, personal communication) and 
support for a susceptibility locus in schizophrenia (23, 35) have 
been reported for this region of chromosome 22. 

We have generated evidence supporting proposed linkages 
(21) on chromosomes 4pl6 (see below), 12q23-q24, and 21q22 
(Table 1; Fig, 1). The 12q23-q24 region has been linked to 
Darier's disease (36), which was found to cosegregate with 
affective disorder (37), Screening of loci in the region identi- 
fied significant linkage (21) to bipolar disorder in Danish 
kindreds (6) and in an extended French-Canadian pedigree (N. 
Barden, personal communication). Confirmatory data were 
detected in our families (ASM II aspex lod = 1.24; P = 
0.0084). We have extended our earlier study on 21q22 (11) and 
found a broad region with excess sharing that included a 
proposed vulnerability locus (10), maximizing at D21S1254- 
D21S65 (ASM I aspex lod = 1.85; P = 0.0018). 

Nonparametric analysis detected other loci with marginal 
signals (lod of >1 but <2; Table 1; Fig. 2). Excess IBD sharing 
(lod - 1.77) was found on 5q33-q35 (Table 1), and by two point 
analysis under a dominant transmission (85% penetrance) 
D5S462 gave a lod of 1.8 (Table 2). Lod scores of >1 under J 
dominant model have been reported for markers mapping to 
this region (38). Excess sharing was displayed by a marker 
mapping to 5q35 in the National Institute of Mental Health 
Genetics Initiative bipolar pedigrees (39). The increase in IBD 
sharing on chromosome llql3 (lod ««1.9) seemed to be 
supported by parametric analysis, because D11S4076, located 
in this region, gave a lod of 1 .55 under a dominant transmission 
(Table 2). A cautious interpretation of these signals is war- 
ranted, because several methods of analysis under two phe- 
notype models were employed; therefore, signals could have 
arisen from random statistical fluctuations. 
Genome-Wide Screening Data Analyzed by a Parametric 



Method. Parametric calculations identified two loci giving lods 
Table 2. Pairwise parametric lod scores >1.5 (6 ^ 0.2) for the entire pedigree series 



Chromosome 


Locus 


lq32 


GATA124F08 


5q35 


D5S462 


7pl5-q22 


D7S492 


10q25 


D10S187 


Ilpl5-pl4 


D11S915 


Ilpl4-pl3 


D11S904 


Ilql2.3-ql3.2 


D11S4076 


13q32 


D13S1271 


14qn-ql3 


D14S1060 


14q32.1-q32.2 


D14S1434 


18pn.2 


D18S1353 


18pn.2 


D18S40 


21q22.1-p22.3 


D21S267 


21q22.3-qter 


D21S212 


Parametric analysis 


was done with 


penetrance. 





ASM 1 



ASM n 



2.37 (dam, 85% pen) 
1.80 (dom, 85% pen) 

1.84 (dom, 50% pen) 
1.62 (dom, 50% pen) 
1.55 (dom, 50% pen) 
2.06 (rec, 85% pen) 

1.79 (rec, 85% pen) 

1.78 (dom, 85% pen) 

1.57 (rec, 85% pen) 

1.79 (rec, 85% pen) 



.54 (dom, 85% pen) 
.59 (dom, 50% pen) 
.69 (rec, 85% pen) 



.69 (rec, 85% pen) 
.19 (rec, 85% pen) 



1.70 (dom, 50% pen) 



(see Methods), dom, dominant; rec, recessive, pen, 
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of >3 in the two largest pedigrees. Pedigree 0048 had 39 
members that were genotyped, with six affected under ASM I 
(14). On 4pl6, a region that showed linkage to bipolar disorder 
m Scottish families (5), D4S2632 gave a lod of 3,24 (0 = Q) for 
ASM I under a dominant, 50% penetrance model. Model-free 
analysis by aspex showed elevated allele sharing within the 
D4S2408-D4S2632 interval (ASM II lod = 1.77; P = 0.0022* 
Table 1; Fig. 2). Hence, these data support the reported 
linkage (5), and the gene on 4pl6 may confer a major effect 
on susceptibility in pedigree 0048. 

Pedigree 1482 had 30 members genotyped with 11 affected 
under ASM II (14). At D14S617 on 14q23-q32.1, this pedigree 
gave a lod of 3.19 (0 = 0; dominant transmission; 85% 
penetrance) under ASM II. Approximately 8 cM distal to 
D14S617 on 14q32.1-q32.2, D14S1434 displayed a total lod 
score of 1.79 for the entire pedigree series under ASM I and 
a recessive, 85% penetrance model (Table 2). aspex analysis 
displayed increased sharing in the region (ASM II lod = 1.23; 

^.^c?nP.^^' J^^^^ ^'S' ^ "^^'^ proximal location! 

pi4S1060, which maps to 14qll-ql3, a region involved in a 
balanced translocation in a patient with bipolar disorder (40) 
displayed a lod of >2 under ASM II and a recessive transmis- 
sion (Table 2). It is unclear whether these signals would be 
supported on further scrutiny of other pedigree series. 

Markers that produced parametric lods of >1.5 for the 
entire panel tended to reinforce findings from nonparametric 
analysis on lq32, 13q32, 18pll.2, and 21q22 (Table 2), even 
though parameters involved in the genetic inheritance of 
bipolar disorder are not known. GATA124F08 on lq32 yielded 
a lod of >2, assuming a dominant mode of transmission, and 
D13S1271 on 13q32 yielded a lod of >2 under a recessive 
model. Similarly, the peak heterogeneity lod in a schizophrenia 
series for a marker on 13q32 was found by using a recessive 
niodel (23). The highest parametric lods and IBD sharing on 
chromosome 18 were found in loci mapping to ISpll 2 (Fig 1) 
The distal portion of 21q also had markers with lods of >1 5 
under a recessive model, which is consistent with findings from 
model-free analysis. Despite the concurrence of signals de- 
rived from different analytical methods and some apparent 
corroborating evidence from other studies, verification of 
linkage requires additional confirmation from other pedigree 
series and must await cloning of genes manifesting variants that 
cosegregate with the affected phenotype. 

By implementing a high-density genome scan, we detected 
potential susceptibility loci for bipolar disorder on 13q32, 
lq32, and 18pll.2 and found support for proposed predispos- 
mg regions. Our data, coupled with independent findings 
suggest the existence of shared loci for bipolar disorder and 
schizophrenia and may provide evidence for the continuum 
model of psychosis (41). 

We thank the patients and their family members for their partici- 
pation in the study. 
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Bipolar disorder or manic depressive illness is a major psychiatric 
disorder that is characterized by fluctuation between two abnor- 
mal mood states. Mania is accompanied by symptoms of euphoria, 
irritability, or excitation, whereas depression is associated with 
low mood and decreased motivation and energy. The etiology is 
currently unknown; however, numerous family, twin, and adop- 
tion studies have argued for a substantial genetic contribution. We 
have conducted a genome survey of bipolar disorder using 443 
microsatellite markers in a set of 20 families from the general North 
American population to identify possible susceptibility loci. A 
maximum logarithm of odds score of 3.8 was obtained at D22S278 
on 22q. Positive scores were found spanning a region of nearly 32 
centimorgans (cM) on 22q, with a possible secondary peak at 
D22S419. Six other chromosomal regions yielded suggestive evi- 
dence for linkage: 3p21, 3q27, 5p15, lOq, 13q31-q34, and 21q22. 
The regions on 22q, 13q, and lOq have been implicated in studies 
of schizophrenia, suggesting the possible presence of susceptibility 
genes common to both disorders. 

Bipolar disorder is a severe psychiatric disorder that affects 
approximately 1% of the world's population (1). It is charac- 
terized by extreme swings in mood between mania and depression. 
Mama is accompanied by euphoria, grandiosity, increased energy 
decreased need for sleep, rapid speech, and risk taking. Depression 
is associated with low mood, low energy and motivation, insomnia, 
and feelings of worthlessness and hopelessness. Psychosis can occur 
m either state, and there is a 17% lifetime risk for suicide. 

The etiology is currently unknown, but epidemiological studies 
argue for a strong genetic component. Family studies indicate an 
approximately 7-fold increase in risk to first-degree family 
members (2). Twin studies find an average 4-fold increase in risk 
to monozygotic vs. dizygotic twins. The mode of genetic trans- 
mission IS unclear. Although some studies have supported the 
presence of autosomal dominant major loci (3, 4), it has also 
been argued that bipolar disorder is oligogenic with multiple loci 
of modest effect. 

Although initial attempts at linkage studies met with incon- 
sistent replication (5-8), more recently, the accumulation of 
multiple studies of larger family sets has led to the reproducible 
identification of several genetic loci. These include 4p, 12q, 13q, 
18, 21q, and Xq among others (9-15). We have previously 
reported on studies of our set of 20 North American pedigrees 
which indicated suggestive evidence of linkage to 5pl5 and 22ql 1 
(16, 17). We now report the results of a genome survey using 443 
microsatellite markers. These data provide strong evidence for 
a locus on 22q and support previously reported loci on several 
other chromosomes. Overlap for some of these loci with those 
reported for schizophrenia also provides additional data arguing 
for possible common susceptibility loci for these two disorders. 



facilities and through advertising and patient support groups 
from two sites: San Diego and Vancouver Families were ascer- 
tained through a proband with bipolar I or bipolar II disorder 
and included if at least two additional members were affected 
under our broad diagnostic model. Power analyses using SIM- 
LINK (18) were conducted to select both families and members 
for maximum power. The family sample included 20 families and 
164 subjects. Thirty-three subjects had a diagnosis of bipolar I, 
15 had bipolar II, and 28 had recurrent major depression There 
were averages of 8.2 subjects and 3.8 affected members per 
family. 

Family members were interviewed directly using the Structured 
Clinical Interview for DSM-3-R (SCID) (19) except for one family, 
the earliest in the study, which was interviewed using the Schedule 
for Affective Disorders and Schizophrenia (SADS-L) (20) All 
diagnoses were made using DSM-3-R criteria, modified to require 
a 2-day minimum duration for hypomania. Wherever possible, 
information was also obtained from other family informants and 
from medical records. Information from all sources was reviewed 
by a committee of experienced psychiatric clinicians to determine 
a consensus, best-estimate diagnosis. All interviewers underwent a 
rigorous, standardized training program for the SCID. Diagnostic 
reliability was regularly tested by review of videotaped interviews 
and was consistently high. 

DNA samples were obtained from the Coriell Institute for 57 
families with bipolar disorder, which were collected as part of the 
National Institute of Mental Health (NIMH) Bipolar Disorder 
Genetics Initiative. These families were used as a replication set 
and were genotyped for selected markers from our genome 
survey. These 57 families included 345 subjects with the follow- 
mg diagnoses: 169 with bipolar I disorder, 36 with bipolar II 
disorder, and 45 with recurrent major depression. Ascertainment 
and diagnostic methods for these families have been described 
elsewhere (21). 

Genotyping. After obtaining informed consent, blood was drawn 
on all subjects for the establishment of lymphoblastoid cell lines 
DNA was prepared from cultured cells by phenol/chloroform 
extraction. 

Markers were primarily tri- and tetranucleotide repeats from 
the Weber 6.0 screening set (22). These were supplemented with 



Methods 

Subjects. Families were ascertained from the general North 
American population through both systematic survey of clinical 
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dinucleotide repeat markers from the Genethon map (23). A 
total of 443 markers was examined. The average intermarker 
interval was 8 cM; the maximum interval between markers was 
33 cM. Map positions were derived primarily from the Marsh- 
field integrated map (24), except for the map used for multipoint 
analyses on chromosome 22, which was calculated from our 
genotypic data as described below. 

In total, 363 of the microsatellite markers were genotyped at 
the Novartis laboratory as follows. The PGR was performed in 
a volume of 20 fi] containing 100 ng of genomic DNA, 0.3 fxM 
of each primer (the forward primer had been labeled at the 5' 
end with fluorescein), dNTP at a concentration of 130 ^M each 
50 mM KGl, 1.5 mM MgGb, 10 mM Tris-HGI (pH 9.0), and 1 U 
Tag DNA polymerase (Amersham Pharmacia). The PGR am- 
plification was carried out in a Biometra Thermo Gycler with the 
following cycles: IX, 5 min at 95°G; 40x, 40 s at 95°G 40 s at 
55'G, and 40 s at 72°G; 1 x 7 min at 72°G. After completion of 
the amplification, 20 ^tl of loading buffer (99% of formamide/l 
mM EDTA/0.05% bromphenol blue/0.05% xylene cyanol were 
added to the PGR reaction. The samples were heated up for 3 
min at 95°G and then chilled on ice; then, 5 ^1 of each sample 
were loaded on a 5% to 8% Long Ranger polyacrylamide/7 M 
urea gel. The percentage of the gel was dependent on the size of 
the PGR product. The separated, fluorescently labeled PGR 
products were visualized by using the Molecular Dynamics 
Fluorlmager and ImageQuant software. The instrument scan 
parameters were adjusted to optimize signal intensity (900 volts, 
200-^m pixel size, normal sensitivity). 

The remainder of the markers were genotyped at the Uni- 
versity of Galifornia, San Diego, laboratory as follows: 50 ng of 
DNA was amplified in either a 5-fx\ reaction using an ABI 877 
catalyst robotic PGR workstation or a 20-/xI reaction using an MJ 
Research PTG-200 thermal cycler. Reactions contained 1 5 mM 
MgCl2. 50 mM KGl, 200-500 nM of each primer 10 mM 
Tris-HGI (pH 8.3), 0.001% gelatin, and 1 unitof either AmnliTan 
or AmpliTaq Gold (Perkin-Elmer). The forward primer was 
labeled with one of three fluors. After an initial 10 min at 95'*G 
(AmpliTaq Gold only), the PGR reaction was conducted using 
the touchdown protocol (25) in which the annealing temperature 
was decreased by V every two cycles from 65° to 55°, followed 
by a final 10 cycles at 55° PGR products were separated by 
electrophoresis and detected using an ABI 377 and Genescan 
and Genotyper software. Multiple markers labeled with differ- 
ent fluors and in different molecular weight ranges were pooled 
along with a molecular weight standard for multiplex detection 
of between 6 and 12 markers per lane. 

All genotypes from each lab were read in a machine-assisted 
fashion independently by two readers. Standard samples were 
used for checking consistency between gel molecular weights, 
and all data were screened for common artifacts using custom' 
data cleaning software. Genotypic data from each laboratory 
was transferred to University of Galifornia, Irvine, for linkage 
analysis. 

Statistical Analysis. A power analysis of this sample was first 
conducted to aid in the interpretation of results. For this 
simulation analysis, assumptions were made of an autosomal 
dominant genetically homogeneous trait and a marker with 4 
alleles of equal prevalence, which was 5 cM from the disease 
gene. Under the narrow disease model, an average logarithm of 
odds (lod) score of 3.55 was obtained, and there was 65% power 
to obtain a lod score >3.0. Under the broad model, the average 
lod score was 5.06, and there was 85% power to detect a lod 
score >3.0. 

Data were recoded to ensure straightforward notation for the 
multiple alleles in such a way as to allow direct comparison across 
the two laboratories. Allele frequencies for the markers were 
estimated from the families using MENDEL (26). These esti- 
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mates were then compared with published frequencies when the 
latter were available. Most of the estimates were comparable 
and if there were large discrepancies, the pedigrees were re- 
viewed for possible errors. No frequency discrepancies were of 
sufficient magnitude to warrant multiple runs using different 
estimates for the same marker(s). 

Linkage analyses were carried out using the parametric lod 
score approach to maximize power over the sample size avail- 
able. Galculations were performed both for disease vs. marker 
and then for marker vs. marker combinations in any area that 
gave some indication of being a potential disease gene location 
The marker mformation was used to confirm the map relations 
among markers within our data as compared with the published 
maps (Gooperative Human Linkage Genter, etc). One set of data 
errors for a single chromosome was detected using the map 
information, and markers were rerun and corrected. 

LIPED (27) was used to calculate the two-point lod scores as 
that program readily permits calculating the scores using differ- 
ent values for male and female recombination. We modeled the 
disease in three ways: (i) autosomal dominant with 0.85 pen- 
etrance (AD85), (ii), autosomal dominant with 0.50 penetrance 
(AD50), and (iii) autosomal recessive with 0.50 penetrance 
(AR50). Markers on the X chromosome were analyzed using a 
dominant model with 0.85 penetrance. Analyses were conducted 
under an assumption of genetic homogeneity. Preliminary anal- 
yses had indicated that similar results were obtained under 
heterogeneity as under homogeneity and that the sample did not 
have adequate power to detect heterogeneity. In addition we 
modeled the affected status for bipolar disorder because it is not 
yet certain which forms may represent the same genetic disease 
We defined affected for the narrow diagnostic model (BP only) 
as bipolar I plus bipolar II plus schizoaffective, bipolar type The 
broad diagnostic model (BP + RD) added the category of 
recurrent major depression to affected status. Subjects with 
other psychiatric diagnoses were considered of unknown affec- 
tion status. The three genetic and two diagnostic models resulted 
in SIX models that were considered. Penetrance and disease allele 
frequency was adjusted for each model so as to yield an 
approximately 5% phenocopy rate and disease prevalences of 
1% for the BP only model and 2% for BP + RD. An age-of-onset 
curve was included with minimal risk below age 15, maximizing 
at the defined penetrance at age 40. 

The lod scores were reviewed by chromosome region to detect 
inconsistencies. For example, one marker showed some evidence 
of Imkage, but the markers on either side were strikingly 
negative. Gareful review of the pedigrees indicated that the 
linked" marker was generated by data from very few informa- 
tive individuals who by chance had no recombinant offspring 
The markers on either side had data from many more informa- 
tive individuals, and numerous recombinants could be identified 
This area was discounted as a potential disease gene region 
After the inconsistency review, regions of interest were defined 
as those with an lod score greater than or equal to 1 0 To 
increase the density of the map in the regions of interest 
additional markers were genotyped. 

Three regions were investigated in depth with multipoint 
analyses using the FASTLINK implementation of the LINK- 
AGE package (28, 29). Two of these regions were on chromo- 
some 22, and one was on chromosome 5. We chose to use only 
those families that showed potential linkage in each region for 
the multipoint analyses. Only in one instance was there improve- 
ment in the lod score using the multipoint approach. 

Results 

The results of the genome survey are illustrated in Fig 1 Twelve 
markers yielded lod scores greater than 2.0 across the genome 
The region with the highest lod scores is on chromosome 22 
which IS further illustrated in Fig. 2. Thirteen markers spanning 
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32 cM yielded lod scores greater than 1.0. A maximum lod score 
of 3.84 was obtained at D22S278 on 22ql2. Six other markers 
within 5 cM also yielded lod scores greater than 1.0, including an 
lod score of 2.39 at D22S683, which is in close proximity to 
p22S278. These results were all obtained under the narrow 
diagnostic model and the high penetrance dominant genetic 
model. A multipoint analysis was conducted using the eight 
families with the highest two-point lod scores to further localize 
the linkage peak. This analysis was consistent with the two-point 
analysis and yielded a maximum lod score of 3.1 at D22S278. 
Outside of this immediate region, the next highest lod score was 
2.19 at D22S419, 15 cM proximal to D22S278. The length of this 
overall region of positive lod scores is consistent with other 
linkage findings in complex disorders (30). Although these 
results suggest a separate peak near D22S419, it is premature to 
estimate whether there might be multiple loci involved. 



In an attempt to examine these results in an independent 
sample, we examined 57 families from the NIMH Bipolar 
Disorder Genetics Initiative sample (21). This is a subset of the 
larger NIMH family set, which was selected based on size and 
mformativeness of pedigrees. As the results of the NIMH 
genome scan have already been reported, these results were not 
intended to provide a new sample for replication but simply to 
analyze the existing and available NIMH sample using the same 
markers and analytical methods as for our 20 pedigrees. The 
results, as illustrated in Table 1, were consistent with those from 
our genome scan. Eleven of the 16 markers examined on 
chromosome 22 yielded lod scores greater than 1.0. These 
positive lod scores spanned the same region implicated in the 
University of California, San Diego/University of British Co- 
lumbia (UCSD/UBC) sample. The maximum lod score in the 
NIMH sample was 2.72 obtained at D22S419. Six other markers 
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Max 



Locus 



D22S420 
D22S264 
D22S427 
D22S425 
D22S539 
D22S303 
D22S257 
D22S1174 
D22S315 
D22S1164 
D22S926 
D22S925 
D22S421 
D22S419 
D22S429 
D22S1144 
D22S689 
D22S684 
D22S693 
D22S691 
D22SliiJ 
D22S5ju 
D22S277 
D22S683 
D22S278 
D22S1142 
D22S283 
D22S692 
D22S1045 
D22S445 
D22S307 
D22S270 
ataSfSOS 
D22S274 

Fig. 2. Maximum lod score represents the highest two-point tod score 
obtained for each marker for all models examined. For each marker, the 
diagnostic and genetic models are indicated, as is the sex-specific recombina- 
tion fraction that yielded the maximum lod score. No lod score is indicated If 
no positive lod score was obtained under any model. 



within 2 cM also yielded lod scores greater than 1.0. D22S278 
which gave the strongest results in the UCSD/UBC sample' 
yielded a maximum lod score of 1.58. Three other nearby 
markers also obtained lod scores greater than 1.0. Therefore, for 
each of the two putative Hnkage peaks on 22q, positive scores 
were obtamed in both samples. However, the region near 
D22S419 was strongest in the NIMH sample, whereas the region 
near D22S278 was strongest in the UCSD/UBC sample. 

Table 2 details several other genomic regions that yielded 
suggestive evidence of linkage in the genome scan of the 
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UCSD/UBC family set. Each of these regions includes at least 
one marker with an lod score greater than 2.0, which is flanked 
by other markers with lod scores greater than 1.0. Two separate 
regions on chromosome 3 were implicated. Each region is 
approximately 30 cM long, and the regions are separated by 60 
cM. D10S1223 yielded an lod score of 2.27 and is flanked by 
several markers with lod scores greater than 1.0 over an interval 
of 20 cM. Another region with particularly suggestive data is 13q 
Three markers, D13S154, D13S225, and D13S796, generated lod 
scores greater than 2.0. Six other makers in this 20-cM interval 
also yielded lod scores greater than 1.0. On 21q, the marker, 
PFKL, yielded a lod score of 2.04. 

5pl5 is of particular interest because it includes an important 
candidate gene, the dopamine transporter. Although the com- 
bined family sample yielded only modest evidence of linkage 
one large family (family 16) generated suggestive evidence of 
linkage. As shown in Table 3, a maximum lod score of 2 77 was 
obtained for this family at D5S417. These results are consistent 
with those we have previously reported in the same family set 
with a less extensive set of markers (16). 

Several other regions deserve mention. Lod scores above 2 0 
were detected on chromosomes 2q and 12p. However the 
absence of any supporting evidence from immediately flanking 
markers caused us to dismiss the importance of these results 
Several other regions yielded modestly positive lod scores in the 
range of 1.0 to 2.0 and are interesting because of other reports 
in these regions. These include 4p, 5q, and 16p. 

Discussion 

We examined 443 microsatellite markers in a set of 20 North 
American families with bipolar disorder. The genome-wide 
maximum was a lod score of 3.8 at D22S278 on 22ql3 under the 
narrow diagnostic definition and an autosomal dominant model 
Other regions with suggestive evidence for linkage include 3d21 
3q27, 5pl5, lOq, 13q31-q34, and 21q22. ' 

We first reported suggestive evidence of linkage to 22q in 1997 
in a subset of this sample (31). This earlier study examined 13 of 
the 20 families reported here and obtained a maximum lod score 
ot 2.51 at D22S303. This report extends our results by examining 
an expanded number of markers in a larger family set. In the 
current sample, the maximum evidence of linkage is found 
^foPo™?'^^^*^ 20 distal to the original result; however, 
D22S303 is near the possible second peak of linkage reported 
here at D22S419. Our results are consistent with several other 
reports examining this region. The NIMH Bipolar Disorder 
Genetics Initiative consortium reported an lod score of 2 5 at 



Table 1. Two-point lod scores for bipolar disorder in 57 NIMH families 

Position Max LOD 

Locus (cM) score* 



D22S303 

D22S1174 

D22S315 

D22S92S 

D22S421 

D22S419 

D22S533 

02251144 

022S689 

D22S691 

D22S1ju 

D22S683 

D22S278 

D22S283 

D22S692 

D22S1045 



16.4 
19.3 
21.5 
21.5 
21.5 
21.5 
22.0 
27.5 
28.6 
32.4 
35.2 
36.2 
36.2 
38.6 
41.4 
42.8 



0.83 
0.86 
1.14 
2.2S 
1.12 
2.72 
1.17 
1.85 
2.17 
0.82 
1.46 
1.34 
1.58 
1.00 
0.28 
0.63 



Diagnostic 
model 



BP only 
BP only 
BP only 
BP only 
BP only 
BP only 
BP only 
BP only 
BP only 
BP only 
BP only 
BP only 
BP + RD 
BP only 
BP only 
BP only 



Genetic 
model 



AR 50 
AD 50 
AR 50 
AR 50 
AR SO 
AR 50 
AR 50 
AR 50 
AR 50 
AR 50 
AR 50 
AR 50 
AD 50 
AR 50 
AR 50 
AD 85 



0m 



*The maximum lod score obtained under any model tested. 



0.5 
0.5 
0.5 
0.5 
0.5 
0.5 
0.3 
0.5 
0.5 
0.5 
0.4 
0.4 
0.5 
0.4 
0.5 
0.5 



Of 



0.2 

0.2 

0.2 

0.1 

0.2 

0.1 

0.1 

0.05 

0.05 

0.1 

0,1 

0.1 

0.2 

0.2 

0.3 

0.3 
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Table 2. Two-point lod scores from other regions with suggestive evidence of linkage 
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*The maximum lod score obtained under any model tested. 



D22S533 using a multipoint sib pair analysis (32). This is 
approximately 1 cM from our secondary peak at D22S419. The 
57 NIMH families we examined are a subset of the NIMH 
Genetics Initiative pedigrees, and our results are consistent 
with theirs. These results on chromosome 22 are also consis- 
tent with a separate set of 21 families collected by the NIMH 
intramural program. Detera-Wadleigh et al. (11) reported a 
maximum lod score of 2.5 using a multipoint sib pair analysis in 
the interval between D22S689 and D22S685. Linkage maps 
position this region approximately halfway between the peaks at 
D22S419 and D22S278 found in our study. However, more 
recent genomic sequence and physical mapping data indicate 
that this region is more telomeric, approximately 2 Mb centro- 
meric to D22S278 (33). 

This 22q region has also been implicated in several studies of 
schizophrenia. Pulver et al. (34) first reported evidence of 
linkage of schizophrenia to 22q at the IL2RB locus near 



D22S278. Subsequently, this region has been implicated in 
several other linkage studies and the marker D22S278 in several 
linkage disequilibrium studies (35-38). Although it is this region 
near D22S278 that has been primarily reported in studies of 
schizophrenia, Myles-Worsley et al. have recently reported evi- 
dence of linkage of schizophrenia to a marker near our possible 
secondary peak at D22S419 (39). They examined a composite 
inhibitory endophenotype in families with schizophrenia and 
found a genome-wide maximum lod score of 3.5 at D22S315, 
which is very close to D22S419. Together, these data provide a 
considerable amount of evidence for loci for both bipolar 
disorder and schizophrenia on 22q. Furthermore, the data for 
both disorders suggest the possibility of more than one linkage 
peak on this chromosome. 

Other regions of suggestive evidence for linkage in our study 
are consistent with previous reports and provide supportive 
evidence for these loci. 13q has been implicated in a recent study 



Table 3. Two-point lod scores at 5p15 in family 16 



Locus 



D5S200S 

D5S678 

D5S1981 

DSS1970 

D5S417 

D5S675 

D5S405 

05S2088 

D5S1492 

D5S406 

D5S2054 

D5S464 

D5S2505 

D5S635 



Position 
(cM) 



1.72 
1.72 
1.72 
5.43 
6.67 
9.41 
9.41 
9.41 
9.41 

11.85 

14.3 

14.3 

14.3 

14.91 



Max lod 
score* 



2.08 
2.14 
1.89 
0 

2.77 
1.57 
1.64 
0.70 
2.02 
2.01 
1.84 
1.88 
1.51 
1.45 



Diagnostic 
model 



BP only 
BP only 
BP only 

BP + RD 
BP + RD 
BP only 
BP only 
BP + RD 
BP + RD 
BP only 
BP only 
BP only 
BP only 



Genetic 
model 



AD85 
AD85 
AD85 

AD8S 
AD85 
AD85 
AR85 
AD85 
AD85 
AD85 
AD85 
AD85 
AD85 



0.001 

0.001 

0.001 

0.5 

0.05 

0.05 

0.001 

0.001 

0.001 

0.001 

0.001 

0.001 

0.001 

0.001 



9f 



0.001 

0.001 

0.001 

0.5 

0.1 

0.5 

0.001 

0.001 

0.2 

0.001 

0.001 

0.001 

0.001 

0.001 



*The maximum lod score obtained under any model tested. 
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of bipolar disorder by Detera-Wadleigh et ai (11) who reported 
a lod score of 3.5 in this region. Our data represent a significant 
replication of these results (40). Like 22q, this region has also 
been imphcated in studies of schizophrenia (41). Evidence for 
hnkage of bipolar disorder to the PFKL locus on 21q was 
reported in 1994 by Straub et ai (14). Subsequently, this locus has 
been supported by several other studies, and our data provide 
further support for this result. Markers on lOq were implicated 
by the NIMH Genetics Initiative consortium who found evi- 
dence for linkage at D10S188, about 20 cM proximal to the 
positive region in our data. Linkage has also been reported in a 
German sample of bipolar families at D10S217 (S. Cichon 
personal communication). Two other groups have also reported 
evidence of linkage to schizophrenia in this region (42). Few 
studies of bipolar disorder have implicated chromosome 3; 
however, Edenberg et al have reported increased allele sharing 
at D3S3038 in studies of the NIMH Genetics Initiative for 
Bipolar Disorder sample (32). 

Our results must be qualified based on the number of different 
models examined. We chose a parametric approach because of 
Its greater power, and we chose to use a limited number of both 
diagnostic and genetic models to best cover the most likely 
modes of transmission. The complexity and non-independence 
of the models used make it difficult to estimate the exact 
significance of these results. Rather, the strength of our results 
must be mterpreted relative to other regions in our genome 
stjrvey and in the context of supporting data in our examination 
of the NIMH sample and other independent data sets. 

Three of the regions implicated in this study (22q, 13q, and 
lOq) have also been reported in studies of schizophrenia. Such 
a correspondence has also been reported for ISp and lOp (43). 
Although such an overlap could occur by chance or reflect two 
nearby but separate susceptibility genes, it also raises the in- 
triguing possibility that many susceptibility genes are common 
for the two disorders. Although family studies have generally 
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found these two disorders to "breed true," a small degree of 
familial overlap has frequently been observed. Recent family 
study data have further supported some degree of overlap (44) 
Only the identification of specific genes will resolve this question 
However, these data raise the hypothesis that some genes 
contribute to susceptibility in a nonspecific way or that different 
mutations in the same gene may predispose to different illnesses 
In summary, our results support the presence of a suscepti- 
bility locus for bipolar disorder on chromosome 22. They also 
provide support for regions previously reported on 5p, lOq, 13q 
and 21q and suggestive evidence for novel loci on 3p and 3q It 
IS intriguing that three of these regions have also been implicated 
m studies of schizophrenia. These molecular data raise the 
possibility that common susceptibility genes may be involved and 
that the relationship between these two disorders may be more 
complex than previously thought. 
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Methylation during Neuronal Differentiation of PC12 Cells 

Thomas R. Cimato,* Murray J. Ettinger,* Xianbo Zhou,* and John M. Aletta* 

*Department of Biochemistry, ^Department of Pharmacology and Toxicology, University at Buffalo School of Medicine and 
Biomedical Sciences, State University of New York, Buffalo, New York 14214 



Abstract Protein methylation is a posttranslational 
modification that can potentially regulate signal trans- 
duction pathways in a similar manner as protein phos- 
phorylation. The role of protein methylation in NGF 
signaling was examined by metabolic labeling of PC12 
cell proteins with L-[methyl-3H]methionine and by in 
vitro labeling of cell proteins with L-[methyl-^H]S-ade- 
nosylmethionine. Effects of NGF were detected within 
15 min. Methyl-labeled proteins were resolved by one 
and two dimensional SDS-PAGE. NGF affected the 
methylation of several 68-60-kD proteins (pi 5.8-6.4) 
and 50-kD proteins (isoelectric point pH 6.7-6.8 and 
5.8-6.2). Several NGF-induced changes in methylation 
increased over several hours and through 4 d. More- 
over, methyl labeling of several specific proteins was 
only detected after NGF treatment, but not in non- 



treated controls. The effects of NGF on protein methy- 
lation were NGF specific since they were not observed 
with EGF or insulin. A requirement for protein methy- 
lation for neurite outgrowth was substantiated with ei- 
ther of two methylation inhibitors: dihydroxycyclopen- 
tenyl adenine (DHCA) and homocysteine. DHCA, the 
more potent of the two, markedly inhibits protein 
methylation and neurite outgrowth without affecting 
cell growth, NGF-induced survival, cell flattening, or 
several protein phosphorylations that are associated 
with early signahng events. Removal of DHCA leads to 
rapid protein methylation of several proteins and con- 
current neurite outgrowth. The results indicate that 
NGF regulates the methylation of several specific pro- 
teins and that protein methylation is involved in neurite 
outgrowth from PC12 cells. 



PROTEIN methylation is a posttranslational modifica- 
tion that may be used to regulate signal transduc- 
tion and differentiation pathways by mechanisms 
that are analogous to regulation by protein phosphoryla- 
tion (Hrycyna and Clarke, 1993; Rando, 1996). Although a 
specific role for protein methylation in prokaryote chemo- 
taxis is well estabHshed (Shapiro et al., 1995), possible 
roles for protein methylation in eukaryotic signaling mech- 
anisms have not been extensively explored. Protein car- 
boxy! methylations are reversible, and regulatory roles for 
carboxyl methylation have been proposed for chemoat- 
tractant responses in neutrophils (Philips et al., 1993, 
1995), insulin secretion from pancreatic islets (Metz et al.,' 
1993), and photoreceptor signal transduction (Parish et al.i 
1995). Among signaling proteins that are known to be car- 
boxyl methylated are the Ras and Rho family of small 
G-proteins (Hrycyna and Clarke, 1993; Rando, 1996), y 
subunits of heterotrimeric G-proteins (Philips et al., 1993; 
Rando, 1996), and the catalytic subunit of protein phos- 
phatase 2A (Lee and Stock, 1993; Favre et al., 1994; Xie 
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and Clarke, 1994). Protein phosphatase 2A is also demeth- 
ylated by a specific protein carboxyl methylesterase (Lee 
et al., 1996). Many proteins are known to be N-methylated 
at arginine, lysine, or histidine residues including cytoskel- 
etal proteins (actin and myosin), nuclear proteins (nucleo- 
lin, fibrillarin, histones, heterogeneous nuclear RNPs), the 
multifunctional calcium binding protein, calmodulin, and 
FGF-2. The physiological functions of methylation, how- 
ever, remain largely unknown. Moreover, potential mech- 
anisms for regulating protein methylation within growth 
factor signaling pathways remain to be explored. 

Methylation pathways use S-adenosylmethionine (SAM)' 
as the universal methyl donor for methyltransferase-cata- 
lyzed methylation of proteins and other methyl acceptors. 
The role of methylation in NGF signal transduction was 
previously studied using high concentrations (millimolar) 
of inhibitors of methyltransferases that use SAM (Seeley 
et al., 1984; Kujubu et al., 1993). These attempts to exam- 



1. Abbreviations used in this paper: DHCA, 9-{trans-2', /ra/T5-3'-dihy. 
droxycyclopent-4'-enyl)-adenine; ERK, extracellular signal-regulated ki- 
nase; PAS, protein-A-Sepharose; 2D two dimensional; PI, phosphoinosi- 
tide; SAHcy, S-adenosylhomocysteine; SAHH, S-adenosylhomocysteine 
hydrolase; SAM, S-adenosylmethionine; Trk, tyrosine kinase receptor. 
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ine the role of methylation in cell signaling were somewhat 
compromised by the lack of specific, nontoxic inhibitors of 
methylation. In addition to the methylated acceptor, S-adeno- 
sylhomocysteine (SAHcy) is a product of all methyltrans- 
ferase reactions. SAHcy is a strong competitive inhibitor 
of SAM (Hildeshein et al., 1972) and is normally removed 
by hydrolysis in a reversible reaction (De la Haba and Can- 
toni, 1959) catalyzed by S-adenosylhomocysteine hydro- 
lase (SAHH). Thus, inhibition of SAHH offers an alterna- 
tive means to inhibit methyltransferases. This approach 
was used in the present work to study protein methylation 
during NGF signaling. The crucial role of SAHH and 
methylation in development is evident from the nonagouti 
(a-^) mutation. The lethality of this mutation is due to a de- 
letion of the SAHH gene, and embryonic development is 
arrested in the preimplantation blastula stage (Miller et 
al., 1994). 

The PC12 clonal cell line was used to examine the role 
of protein methylation in NGF signal transduction. PC12 
cells are derived from a rat pheochromocytoma and serve 
as a model of neuronal differentiation. When exposed to 
NGF, PC12 cells assume many of the features of sympa- 
thetic neurons including cell cycle arrest, survival in se- 
rum-free medium, and elaboration of long neurites (Greene 
and Tischler, 1976; Tischler and Greene, 1978). Moreover, 
PC12 cells also respond to other growth factors. For exam- 
ple, in the presence EGF, PC12 cells do not differentiate 
mto neuronlike cells, but do increase their mitotic rate 
(Huff and Guroff, 1981). Thus, PC12 cells are useful to 
study specific growth factor signaling mechanisms. The 
specific signaling events and proteins involved in neurite 
outgrowth have not been fully elucidated. For example, 
NGF and EGF both activate several signaling proteins in- 
cluding phospholipase C, phosphoinositide (PI)3-kinase, 
extracellular signal-related kinase (ERKs) (MAP ki- 
nases), and Ras (Kaplan and Stephens, 1994; Marshall 
1995). However, NGF, but not EGF, stimulates neurite 
outgrowth in PC12 ceils. One hypothesis for the specificity 
of NGF actions involves kinetic differences in the activa- 
tion of both Ras and ERK (Marshall, 1995). NGF stimula- 
tion of PC12 cells results in prolonged activation of both 
Ras and ERK activity, while EGF causes a transient rise 
and fall in Ras and ERK activities (Qui and Green, 1992). 
Specificity may also arise from NGF-specific signaling 
pathways that are not activated by EGF and other mito- 
gens (Chao, 1992; Kaplan and Stephens, 1994; Peng et al., 
1995). The results reported here indicate that signaling 
pathways involving protein methylation may contribute to 
NGF specificity. 

Previous studies by Seeley et al. (1984) have implied 
that methylation is necessary for neurite outgrowth in 
PC12 cells. Kujubu et al. (1993) and Haklai et al. (1993) re- 
ported that methylation of small G-proteins is regulated 
by NGF in PC12 cells. Najbauer and Aswad (1990) have 
also characterized several methyl-arginine containing pro- 
teins in PC12 cells. The effects of NGF on protein meth- 
ylation and the effect of inhibition of protein methylation 
on neurite outgrowth are, however, incompletely charac- 
terized and have been difficult to interpret due to the lack 
of specificity of the methylation inhibitors used in previous 
studies. The present work extends previous findings by ex- 
amining the methylation of specific cellular proteins and 



neurite outgrowth before and after inhibition of the pro- 
tein methylation pathway. 9-{trans-2\ trans-3' -dihydroxy- 
cyclopent-4-enyl)-adenine (DHCA), a specific, mechanism- 
based inhibitor of SAHH, inhibits protein methylation and 
greatly decreases neurite outgrowth. The results indicate 
that NGF, but not EGF or insulin, regulates the methyla- 
tion of several specific proteins and that protein methyla- 
tion IS required for neurite outgrowth from PC12 cells. 

Materials and Methods 
Reagents 

NGF was purified from male mouse submaxillary glands as described pre- 
viously (Mobley et al., 1976). Insulin, L-homocysteine thiolactone, aniso- 
mycm leupeptin, PMSF, aprotinin, and monoclonal phosphotyrosine anti- 
body (clone PT-66) were purchased from Sigma Chemical Co. (St. Louis 
MO). Homocysteme was prepared by incubating L-homocysteine thiolac- 
tone with 40 mM NaOH at 37''C for 30 min. EGF was obtained from Up- 
'J^if^j^'^'^^^^^^'^Sy I"^- (Lake Placid, NY). DHCA (kindly provided by 
R.T Borchardt, University of Kansas, Lawrence, KS) was prepared as 1 
and 100 mM stock solutions in DMSO. Erythro.9-(2-hydroxy-3-nonyI) ad- 
enme was kmdly provided by D. Porter (Burroughs Wellcome, Research 
IVr.^w^f"' ^-["iethyl-3H]methionine (7L4 Ci/mmol), L.[methyl- 
HjSAM (70 Ci/mmol), ["'Cjadenosine (59.8 mCi/mmoI), and carrier-free 
I Plorthophosphate were purchased from Dupont-NEN (Boston MA) 
I-donkey anti-rabbit IgG was purchased from Amersham Corp (Ar- 
lington Heights, IL). Protein A-Sepharose (PAS) and ampholines were 
obtamed from Pharmacia Fine Chemicals (Piscataway, NJ). Donor horse 
serum and fetal bovine serum were from JRH Biosciences (Lenaxa KS) 
Cell culture medium, penicillin, and streptomycin were obtained from 
Gibco Laboratories (Grand Island, NY). Restriction endonuclease Mspl 
was from New England Biolabs Inc. (Beverly, MA), and nuclease PI from 
United States Biochemical Corp. (Cleveland, OH). 



Cell Culture and Bioassays 



Stock cultures of PC12 cells were grown on collagen-coated tissue culture 
dishes (Falcon Plastics, Cockeysville, MD) in RPMM640 medium supple- 
mented wih 10% heat-inactivated horse serum and 5% fetal bovine se- 
rum at 35*C and 7.5% CO;, as previously described (Greene et al 1987) 
For neurite outgrowth studies, PC12 cells (3-5 X 10^ cells) were plated 
onto collagen-coated 35-mm plastic tissue culture dishes and cultured in 
RPMM640 medium containing 1% heat-inactivated horse serum and 50 
ng/ml NGF. Neurites >20 y.m were counted as neurite-bearing cells At 
least 100 cells from each experimental condition were scored from ran- 
domly chosen fields. 

.^^JJSf'^ "^""^^ proliferation and survival in the presence and absence 
of DHCA, naive PC12 cells were plated on collagen-coated, 24-well tissue 
culture dishes at a density of 10^ cells per well. 1 d after plating, cells were 
washed three times in RPMI-1640 medium to remove serum and then 
placed either in complete medium (RPMI plus 10% horse serum and 5% 
fetal bovine serum), in RPMI plus 50 ng/ml NGF, or in RPMI alone 
Where indicated, 1 DHCA was included in the culture medium The 
number of viable cells was determined by counting intact nuclei using a 
hemacytometer (Soto and Sonnenschein, 1985) immediately after washing 
the cultures with RPMI at time intervals of 1, 3, and 5 d later 
^J^V^^ regeneration, PC12 cells were treated with 

NGF for 7-14 d. NGF was then thoroughly washed away from the cells 
and the cells were detached from the culture dish by trituration Cells 
were then replated in the presence or absence of NGF, and neurite out- 
growth was scored 24 h later. 

Metabolic Radiolabeling of Proteins 

To assess endogenous protein methylation under various experimental 
conditions, PC12 cells were cultured in 1% horse serum for at least 16 h 
before metabolic radiolabeling for 6 h with 100 fiCi/ml L-fmethyl-^Hlme- 
thionme in the presence of 10 ^jlM anisomycin at 35T in a CO^ incubator 
The methionme/cysteine content of RPMI-1640 was reduced by 80% dur- 

'"v^/^fJi*'^''"^ P^"''''- ^'"""^^^ t^G^' EGF, or insulin) and/or 1 

|xM DHCA were added for the indicated times. At the conclusion of the 
labeling period, cells were washed three times with 1 ml of PBS (35**C), 
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harvested in 200 ^1 SDS-PAGE lysis buffer, and held in a boiling water 
bath for 5 min. Total radiolabeled protein in each lysate was determined 
by TCA precipitation and liquid scintillation spectrometry. The averages 
of the determinations are given under Results as the means ± SEM. Ali- 
quots of each cell lysate containing equal TCA-precipitable counts per 
mmute were subjected to SDS-PAGE as described below. For analysis of 
protein methylation by two-dimensional (2D) lEF x SDS-PAGE, the cells 
were harvested in lEF lysis buffer (Aletta and Greene, 1987) without 
heating the sample. 

To assess changes in protein phosphorylation, PC12 cell proteins were 
metabolically radiolabeled as previously described (Aletta. 1996) in a 
Hepes-buffered Krebs- Ringer solution containing 0.1% D-glucose. When 
the effect of DHCA was examined, the drug was added to cultures 30 min 
before addition of the radioisotope. The labeling was carried out with 50 
M-Ci/ml p2p]orthophosphate at dS^C in room air for 2 h. At the end of the 
labeling period, cells were washed three times with 1 ml of PBS (35°C) 
harvested in SDS-PAGE lysis buffer, and held in a boiling water bath for 
5 mm. Total radiolabeled protein in each lysate was determined by TCA 
precipitation and liquid scintillation spectometry. 

In Vitro Protein Methylation 

To measure protein methylation in subcellular fractions, cells were 
washed three times in PBS (4''C) and harvested by scraping with a rubber 
policeman in ice-cold homogenization buffer (100 mM Tris, pH 8.0, 1 mM 
^o^'^'^' ^ ^"'^ leupeptin). After homogenization at 

4 C m a Dounce homogenizer, the homogenates were centrifuged at 4T 
for 10 min at 500 5 to obtain a crude nuclear fraction. The supernatant was 
centrifuged at 20,000 g for 5 min at 4''C. The resulting supernatant was 
centrifuged at 100,000 g for 90 min at 4X to obtain the cytoplasmic frac- 
tion. The pellet of the 100,000 g spin was washed three times in homogeni- 
zation buffer and resuspended in 150 ptl of homogenization buffer to ob- 
tain the membrane fraction. Protein in each fraction was determined 
spectrophotometrically (Bradford, 1976). Equal amounts of protein (125 
M-g) from each fraction were incubated with 4.25 ^iCi L-fmethyl-^HlSAM 
in a total volume of 50 ^1 for 1 h at STC. The labeling reaction was 
stopped by adding 12.5 ^lI of 5x SDS-PAGE sample buffer (0.3 M Tris- 
HCl, pH 6.8, 45% glycerol, 1.4 M 2-mercaptoethanol, 10% SDS, 0.001% 
bromophenol blue) and holding the samples in a boiling water bath for 5 
min. In vitro-Iabeled proteins from the cell fractions were then analyzed 
by 7.5-15% gradient SDS-PAGE as described below 



Gel Electrophoresis 

Discontinuous SDS-PAGE (Laemmli, 1970) was performed with 19-cm 
separating gels composed of polya cry 1 amide gradients of 6-12, 7.5-15, or 
8.5-15%, depending upon the experiment. Gels were fixed, stained with 
Coomassie blue, and then destained. Gels containing proteins labeled 
with [ P]orthophosphate were dried and placed in contact with Kodak 
XAR film to produce an autoradiographic image. Gels containing tritium- 
labeled proteins were prepared for fluorography by washing the gel for 1 h 
m three changes of deionized water, followed by immersion in 1 M sodium 
salicylate for 30 min (Chamberlain, 1979). After drying, the gels were ex- 
posed to preflashed Kodak XAR film (Laskey and Mills, 1975) and stored 
at -70°C (Bonner and Laskey, 1974) with an intensifying screen. Quanti- 
tative comparisons of methyl-labeled proteins were obtained by scanning 
nuorograms into an analysis program (Molecular Analyst; Bio Rad Labo- 
ratories, Hercules, CA). 

For 2D lEF X SDS-PAGE, PC12 cell proteins were labeled as de- 
scribed above and harvested in a lysis buffer appropriate for lEF (Aletta 
and Greene, 1987). Equal TCA-precipitable counts per minute of cell ly- 
sates were subjected to lEF with pH 5-7, and 3.5-10 ampholines at a ratio 
of 4:1, respectively. Proteins in lEF gels were further resolved by SDS- 
PAGE (the second-dimension) using 12-cm separating gels composed of 
7.5-15% polyacrylamide gradient. A standard lEF gel containing only ly- 
sis solution was used to determine the pH range. Fluorographic images of 
tntmm-labeled proteins were generated as described above. 

DNA Methylation 

Total methylation of cytosines in DNA was determined by first digesting 
PCI 2 cell DNA with Mspl, which cleaves methylated or unmethylated 
CCGG sequences (Bestor et al., 1984). The products are 5' labeled with 
I PJATP, and digested with nuclease PI. Methyl-cytosines are resolved 
from unmethylated cytosines by chromatography in isobutyric acid/water/ 



ammonium hydroxide (66:33:1) and detected by autoradiography (Bestor 
et al., 1984). Liquid scintillation spectrometry was used to quantify the re- 
sults. 

Immunoprecipitation 

Tyrosine phosphorylation of Trk in the presence or absence of DHCA 
was assessed by immunoprecipitation followed by Western blotting Cells 
were treated with 100 ng/ml NGF for 5 min, with or without 1 ^iM DHCA 
Cells were then washed three times in ice-cold PBS and harvested in 1 ml 
of a lysis buffer containing 1% Triton X-100, 150 mM NaCI, 50 mM Tris- 
HCI, pH 8.0, 25 mM NaF, 5 mM EGTA, 5 mM EDTA, 2 ^iM PMSF and 
100 U/ml aprotinin. Insoluble material was removed by centrifugation at 
4°C for 10 min at 1 3,000 Samples were precleared with 6 mg PAS for 2 h 
followed by centrifugation for 10 min at 13,000 Anti-phosphotyrosine 
monoclonal antibody (clone PT-66) was added to equal amounts of lysate 
protem for 2 h on a rotating platform at 4X followed by addition of 3 mg 
of PAS and incubation on the rotating platform at 4'*C for 1 h. PAS beads 
were then recovered by centrifugation at 13,000 5 for 10 min, and washed 
three times with 1 ml of 1% Triton X-100 lysis buffer followed by two 1-ml 
washes in lysis buffer without Triton X-100. The PAS beads were then re- 
suspended in SDS-PAGE sample buffer and held in a boiling water bath 
for 5 mm. The precipitated material was resolved by SDS-PAGE (7 5% 
acrylamide) followed by transfer to Immobilon P membrane (Miilipore 
Corp., Milford, MA). The blot was probed with 1 ng of rabbit polyclonal 
Irk antiserum (Santa Cruz Biotechnology, Santa Cruz, CA) for 2 h at 
room temperature followed by incubation with '"I-donkey anti-rabbit 
IgG. The dried blot was then exposed to Kodak XAR film at -70°C with 
an intensifying screen. 



SAHH Activity 

PCI2 cells were plated on 150-mm dishes (2 x 10^ cells) in the presence of 
100 ng/ml NGF plus 10 nM to 3 DHCA, or without added DHCA for 
7 d. Cells were then washed three times in ice-cold PBS and harvested by 
scraping in a potassium phosphate buffer, pH 7.0 (25 mM KH2PO4 25 
mM K2HPO,, 1 mM dithiothreitol, 0.5 ^iM PMSF, and 10 jig/ml leupep- 
tm). Cells were homogenized in a Dounce homogenizer, and the cell nu- 
clei and debris were removed by centrifugation at 13,000 g. Cytosol was 
then prepared by centrifugation at 100,000 g for 90 min and the protein 
concentration determined. SAHH activity was determined in the synthesis 
direction by a TLC method described previously by Hershfield (1979). 

Results 

NGF-specific Induction of Changes in the Pattern of 
PC12 Cell Methylated Proteins 

Previous studies have implicated regulation of protein 
methylation in the early events of NGF-mediated signal 
transduction (Seeley et ah, 1984; Kujubu et al, 1993). To 
examine this possibility in greater detail, protein methyla- 
tion patterns were assessed after metabolic radiolabeling 
of cellular proteins with L-[methyl-3H]methionine. Fluoro- 
grams of the labeled proteins generated from SDS-PAGE 
and 2D lEF X SDS-PAGE were used to detect specific 
changes in protein methylation after NGF treatment. The 
cellular pool of SAM, the predominant methyl donor in all 
cells, was radiolabeled using L-[methyl-3H]methionine in 
control PC12 cells, and PC12 cells treated with either 50 
ng/ml NGF, 5 nM EGF or 1 ^xM insulin. These concentra- 
tions and growth factors were chosen based on the biologi- 
cal effects produced by each in PC12 cells. NGF at 50 ng/ 
ml produces the maximum neurite outgrowth response. 
EGF at 5 nM enhances cell proliferation (Huff et al., 1981) 
and 1 fxM insulin is commonly used to effect cell survival 
in serum-free media (Rukenstein et al., 1991). We have 
verified that receptors for all three of these growth factors 
are present on the PC12 cells used in these studies, by ob- 
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Figure L NGF induces specific changes in the pattern of in vivo 
protein methylation observed by 2D lEF x SDS-PAGE. PC12 cell 
proteins were metabolically labeled with L-[methyl-3H]methio- 
nine (100 M-Ci/ml) in the presence of anisomycin (10 pM) for 6 h. 
Whole cell iysates containing equal TCA-precipitable cpm 
(800,000) were separated by lER lEF gels were loaded onto 8.5- 
15% gradient gels for the separation of methylated proteins by 
molecular mass. Fluorographic images generated from the con- 
trol and 6 h NGF (50 ng/ml) conditions are illustrated in the top 
two panels. Molecular mass standards (in kD) are indicated at the 
left of each panel. The boxed area in the top panels is enlarged 
below for control, 6 h NGF and 6 h EGF (5 nM) samples. Fluoro- 
grams were generated by exposing x-ray film to the dried gels 
with an enhancing screen for 10 d at -70*C The pH gradient of 
the isoelectric focusing tubes is shown below the enlarged images 
{a~c). Labels {A, *, and +) indicate unaffected methylations, 
while the remaining letters and numbers indicate protein meth- 
ylations affected by NGF. These results are representative of 
three independent experiments. 



servation of the aforementioned biological effects at the 
doses indicated (data not shown). Radiolabeled methio- 
nine equilibrates with the cellular SAM pool within 20 min 
(Chelsky et al., 1985). A protein synthesis inhibitor, aniso- 
mycin (10 M-M), was present during the metabolic radiola- 
beling to prevent incorporation of L-[methyl-3H]methio- 
nine into newly synthesized proteins. 

In each of five independent trials, all three trophic fac- 
tors produced a net increase in methyl group incorpora- 
tion into PC12 cell protein. After 6 h of treatment, EGF 
and insulin lead to ~50% greater incorporation (±0.2, 
SEM) and NGF to a 31% increase (±0.1, SEM), Despite 
the slightly larger effects of EGF- and insulin-promoted 
increases in total methyl-^H-labeled protein, NGF treat- 
ment consistently yields protein methylation patterns ex- 
hibiting more pronounced changes in specific methylated 



proteins (Fig. 1). Several, indistinctly resolved proteins 
that migrated between 24 and 20 kD showed increased 
methyl labeling relative to total protein methylation after 
6 h NGF treatment (Fig. 1, top right). Among the meth- 
ylated species detected in control cell Iysates, were nine 
68-60-kD proteins with isoelectric point (pi) values of 5.8-6.4 
(Fig. 1, A-G, *, and +). After treatment with 50 ng/ml 
NGF for 6 h, the proteins labeled A, *, and + in Fig. 1 
showed little or no reproducible changes in methylation 
relative to controls. Although there was a net increase in 
total protein methylation, both increases and decreases in 
the methylations of specific proteins relative to total pro- 
tein methylation were detected in response to NGF. De- 
creased labeling of proteins D, and E was detected with 
NGF (Fig. 1 b). Methyl labeling of proteins C, F, and C in- 
creased markedly, and methyl group incorporation into 
two proteins (N and N') was detected only after 6 h in 
NGF-treated cells, but not in controls (Fig. 1 b). 

Significant NGF-induced effects on protein methylation 
were also detected at pi 6.7-6.8 and 5.8-6.2 in the 50-kD 
region (Fig. 1, « and b). Labeling of protein 3 was unaf- 
fected by NGF. Decreased methyl labeling of protein 1 
was detected after 6 h NGF treatment, while proteins 2, 4, 
and 7 showed increased methyl labeling, and the [methyl- 
^H] incorporation into protein 5 was only detected in 
response to NGF. The effects of NGF on protein methyla- 
tion are unlikely to be due to incorporation of L-[meth- 
yl-^H]methionine into newly synthesized proteins because 
protein synthesis was inhibited by 97 ± 0.1% (n = 3) un- 
der the conditions of these experiments. In addition, no 
significant increases or decreases in the amounts of the 
specific proteins analyzed in Fig. 1 were detected by 2D 
lEF X SDS-PAGE of p^SJmethionine-labeled proteins in 
whole cell Iysates, in the absence of protein synthesis inhi- 
bition (data not shown). 

The NGF-induced changes in methyl labeling of specific 
proteins were not observed with the two other trophic fac- 
tors tested. After EGF treatment for 6 h, the methylation 
pattern of the 68-60-kD proteins was quite different from 
the NGF-induced changes relative to the control, nonstim- 
ulated cells (Fig. 1). The pattern of protein methylations 
with EGF was more similar to the pattern with control 
ceils, but small increases in the methylation of proteins in 
this relative molecular weight range were reproducibly de- 
tected (B, C, D, and E). NGF, on the other hand, de- 
creased the methylation of proteins B, D, and E. The 
changes in protein methylations were less marked with 
EGF than NGF and no methylated protein was detected 
with EGF that was not detected with controls. Moreover, 
no significant effects of EGF were detected in the 50-kD 
set of methylated proteins. Insulin had no significant ef- 
fects on the methylation of the 68-60- or 50-kD proteins 
(data not shown). 

The effects of NGF on the pattern of protein methyla- 
tion were also examined by in vitro labeling of proteins in 
subcellular fractions from NGF-treated cells as an inde- 
pendent approach for detecting the protein methylations 
that are affected by NGF, In vitro labeling provides a com- 
plementary method of analysis that does not require a pro- 
tein synthesis inhibitor. It also indicates the possible cellu- 
lar locations of protein methylation. PC12 cells were 
incubated without or with NGF for 15 min, 6 or 16 h. Nu- 
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Figure Z NGF-treatment of intact cells stimulates in vitro pro- 
tein methylation in cell-free extracts. (A) PC12 cells were incu- 
bated in the presence or absence of NGF (50 ng/ml) for 6 or 16 h. 
Proteins in the nuclear (500^, pellet), membrane (100,000^, pel- 
let), or cytosolic (100,000 g, supernatant) fractions were radiola- 
beled with L-[methyl-3H]SAM for 1 h at 37°C. The labeling reac- 
tion was quenched by the addition of 5x SDS-PAGE sample 
buffer followed by holding the reaction tube in a boiling water 
bath for 5 min. Labeled proteins were separated by 7.5-15% gra- 
dient SDS-PAGE. The image shown is a fluorogram of the dried 
gel. Asterisks indicate methylated proteins not detectable in ex- 
tracts from non-NGF-treated control cells. Arrows identify in- 
creases of 67% or greater. The arrowheads (membrane fraction) 
pomt out decreases of >30%. These results were reproduced in 
an independent experiment. Specific changes at 16 h were as fol- 
lows: nuclear proteins at 97 and 64 kD increase 2.4-fold; cytosolic 
proteins at 50 kD and 35 kD increase 67 and 79%, respectively. 
(B) Cell proteins were metabolically radiolabeled followed by 
preparation of cytosol (in vivo) or the cytosol was prepared first, 
followed by incubation with [methyl-^HlSAM (in vitro). A por- 
tion of the 2D lEF x SDS-PAGE fluorograms from control 
i-NGF) versus 6 h NGF treatment {+NGF) are displayed. 
There are five individual protein spots migrating at 64 kD, which 
are most easily discerned in the in vivo +NGF condition. The 
64-kD series of spots in each of the other fluorograms (four spots 
each) are superimposable with those of the in vivo -f NGF condi- 
tion. 



clei (500 g\ membranes (100.000 g, pellet), or cytosol 
(100,000 g, supernatant) were isolated and incubated with 
[methyI-3H]SAM for 1 h at 37°C, in vitro. Methylated pro- 
teins were resolved by 7.5-15% gradient gel SDS-PAGE 
and detected by fluorography. Changes in protein methyl- 
ation were detected in nuclei, membranes, and cytosol 
(Fig. 2). Increases in protein methylation were detected af- 
ter 15 min of NGF treatment (data not shown) and contin- 
ued to increase from 6 to 16 h (Fig. 2 .4). Moreover, sev- 
eral of the proteins that showed increases in methylation 
have similar molecular weights as those detected by 2D 
lEF X SDS-PAGE (Fig. 1) after metabolic radiolabeling 
of intact cells (e.g., 68-64 kD, and 50 kD). Thus, changes 
in the methylation of proteins in response to NGF was 
confirmed by two independent approaches. In the nuclear 
fraction, time-dependent increases (more than twofold) in 
protein methylation in response to NGF treatment were 
detected in proteins migrating at 97, 94, 67, and 64 kD. In 
the membrane fraction, the methylation of proteins mi- 
grating at 50 and 34 kD decreased after 6 or 16 h of NGF. 
Several proteins in the cytosolic fraction showed increased 
methylation. Most prominent among these were 114, 94, 
50, and 35 kD. Additional NGF-induced effects were ob- 
served by the in vitro method. This is most likely due to 
the higher specific activity of the [methyl-^H]SAM pool in 
vitro than in the intact cell experiments and the enrich- 
ment of proteins by subcellular fractionation. In addition, 
the in vitro labeHng experiments indicate that both the 
requisite methyltransferase and protein substrate were 
present at the time of analysis in each of the subcellular 
fractions that were isolated from NGF-activated cells. 
Moreover, irrespective of the specific mechanisms respon- 
sible for activation of the methylation of specific proteins 
by NGF, the state of activation was stable during prepara- 
tion of the fractions. 

To validate this approach further, cytosol was prepared 
from cells treated with or without NGF after metabolic ra- 
diolabeling, and replicate cultures without metabolic ra- 
diolabeling were also processed to obtain cytosol for in 
vitro protein methylation. Comparisons of the 2D lEF X 
SDS-PAGE fluorograms from each type of preparation 
indicate that several of the same proteins resolved in the 
64 kD range (pi 5.8-6.4) are similarly increased after 
NGF treatment by either method of analysis (Fig. 2 B). 
The in vitro data obtained after NGF treatment for 6 h 
confirm that changes in protein methylation triggered by 
NGF can occur whether or not protein synthesis has been 
mhibited. Thus, the in vivo and in vitro experiments inde- 
pendently indicate that NGF produces diverse, marked ef- 
fects on the regulation of the methylation of several spe- 
cific proteins. 

NGF Affects Protein Methylation during Early and 
Delayed Signaling 

The time dependence of NGF-induced changes in protein 
methylation was examined to determine the onsets and 
durations of changes in protein methylation. PC12 cells 
were incubated with NGF for 15 min to 24 h to examine 
protein methylation during early stages of NGF signaling 
and for 4 d to examine changes occurring concurrently 
with the appearance of neurites. Total protein methylation 
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Figures. The time dependence of total methyl group incorpora- 
tion into proteins in response to NGF. PC12 cell proteins were 
metaboiically labeled with L-fmethyl-^HJmethionine (100 ^Ci/ 
ml) for 6 h in the presence of anisomycin (10 jjlM). NGF (50 nej 
ml) was added to the cell cultures for the times indicated. Cells 
were lysed in SDS-PAGE sample buffer. Total methyl labeling of 
protems (counts per minute) was measured for each condition af- 
ter TCA precipitation of an aliquot of the sample. The relative 
amount of the ^H-labeled methyl groups in lysate proteins at each 
time point is given with respect to nontreated control ceils The 
data shown are the means ± SEM for five independent experi- 
ments. ^ 



was assessed by the incorporation of pHJmethyl groups 
into TCA-precipitable protein during NGF treatment for 
24 h (Fig. 3). Significant increases in the total amounts of 
methyl group incorporation were detected within 15 min 
after the addition of NGF. The response over the early 
time course is biphasic with an initial rapid burst of methy- 
lation that decays between 1 and 4 h of NGF treatment 
followed by a second, persistent phase of elevated meth- 
ylation from 6 to 24 h. The pronounced decrease in methyl 
group incorporation between 1 and 4 h indicates that the 
effect of NGF on total protein methylation is likely to be 
complex and dependent upon multiple factors, including 
methionme uptake, intracellular compartmentalization 
and use of the labeled methyl group in other metabolic' 
pathways. Thus, all comparisons of protein methylation 
patterns examined by gel electrophoresis were standard- 
ized relative to total methyl-^H-labeled protein. 

Although the resolution of changes in protein methyla- 
tion m whole cell lysates is reduced on one dimensional 
SDS-PAGE gels, this method was used to observe the 
time course of changes in the set of proteins migrating at 
64-62 kD. Gradient SDS-PAGE and fluorographic analy- 
sis of methyl-^H-Iabeled proteins in whole cell lysates in- 
dicated that there are increases in the methylation of the 
proteins in this region of the gel, detectable within 1 h af- 
ter the addition of NGF and persisting for at least 8 h (Fig 
4 /1). In addition, when using an 8.5-15% acrylamide gra- 
dient, a reproducible NGF-specific change in a 34-kD pro- 
tein IS evident at 3 and 6 h of NGF treatment (Fig. 4 B) 
but not at times earlier than 2 h (data not shown). 

To determine if any of the changes in methyl group in- 
corporation persist for several days, the labeling studies 
were repeated in PC12 cells treated with NGF for 4 d. This 
time point was chosen because ->50% of PC12 cells have 




A Figure 4. Changes in the pat- 

{■) 15' ifi 2h All th w ^^^^ of protein methylation 
^ wm^m m^mm specific and time 

dependent. (A) PC12 cell 
^ proteins were metaboiically 

labeled with L-fmethyl-^HJme- 
thionine (100 jxCi/ml) for 6 h 
in the presence of anisomy- 
cin (10 ^M) with or without 
NGF (50 ng/ml) for the times 
indicated. Whole ceil lysates 
containing equal TCA-pre- 
cipitable cpm (300,000) were 
loaded in each lane. Proteins 
were separated by 7.5-15% 
gradient SDS-PAGE. The 
image shown is a fluorogram 
of the dried gel after expo- 
sure to x-ray film for 3 d at 
-70**C with an intensifying 
screen. The bracket indicates 
the migration position of a 
64-62-kD complex of proteins 
that increase in a time-depen- 
dent manner. The arrowhead 
indicates the migration posi- 
, , . ^ t'ori of an invariant protein 

band, the density of which does not change by more than a few 
percent over the time course. By this measure, the 4-h lane is 
slightly overioaded, and the 6-h lane, undedoaded. Nevertheless 
the change in the 64-62-kD protein complex is still evident in the 
latter. The results illustrated here are representative of five inde- 
pendent experiments. (B) Metaboiically radiolabeled methylated 
proteins (300,000 cpm) were obtained as described above and 
loaded on 8.5-15% polyacrylamide gradient gels to more favor- 
ably resolve the area between 29 and 36 kD. The image shown is 
a fluorogram derived from the dried gel after exposure to x-ray 
fi m for 2 d at -70X. The arrow indicates the migration position 
of a regu ated 34-kD protein. Cultures were treated with NGF 
(50 ng/ml), EGF (5 nM), or insulin (1 ^.M). The results shown are 
representative of three independent experiments 



neurites ^20-M.m long after 4 d of NGF (Greene et al 
1982). In addition, it was reasoned that some protein 
methylations may occur during neurite outgrowth that 
were not detected during early stages of NGF action 
Thus, after 4 d of NGF, methylated proteins were ana- 
lyzed by metabolic radiolabeling of cell proteins followed 
by 2D lEF X SDS-PAGE (Fig. 5) as described above. The 
pattern of PHJmethyl incorporation into PC12 cell pro- 
teins relative to total methylated protein indicates that 
many of the changes induced by 6 h of NGF (Fig. 1) persist 
for several days. Moreover, several of the NGF-induced 
increases and decreases in protein methylation observed 
after 6 h of NGF treatment were greater after 4 d of NGF 
treatment. Methylation of proteins between 68 and 60 kD 
(C, F, G, N, and N') and in the 55-50 kD region (2-4 6 
and 7) increased relative to both the control (Fie 5) and 
the 6 h NGF result (Fig. 1). Proteins C, F, G, N, and N' ap- 
pear to be the same proteins (based on pi and M,) that 
showed increases in methylation after 6 h of NGF treat- 
ment. Also, after 4 d of NGF treatment, the methylation of 
proteins B, D, and E, as well as protein 1 in the 50-kD re- 
gion decreased relative to the control or 6-h NGF result A 
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Figure 5. NGF-induced changes in protein methylation persist 
and progress during prolonged NGF treatment. PC12 cells were 
incubated with or without NGF (50 ng/ml) for 4 d. Cells were la- 
beled with L-[methyl-3H]methionine (100 jjiCi/ml) in the presence 
of anisomycin (10 ^M) for 6 h. Equal TCA-precipitable cpm 
(750,000) of cell lysates were subjected to lEF. lEF gels were 
loaded on 7.5-15% gradient SDS-PAGE gels to separate meth- 
ylated proteins by molecular mass. The images shown are fluoro- 
granis of the dried gels after exposure of x-ray film for 12 d at 
-70*'C with an intensifying screen. The areas of the gels shown 
are similar to the boxed areas shown in Fig. 1. The pH gradient of 
the lEF gels is indicated at the top of the figure. Labels are as in 
Fig. 1. 



number of novel methylated proteins were also observed 
after 4 d of NGF treatment that were not detected after 
6 h. These include proteins B', C' in the 60-kD region of 
the gel, and proteins 1\ 7", 8, and 9, appearing in the 50- 
kD region of the gel. The distinctly different time courses 
for the methylations of specific proteins suggest that NGF 
may regulate the methylation of proteins involved in mul- 
tiple signaling pathways. Moreover, most of the NGF- 
dependent changes in methylated proteins that were de- 
tected are not associated with transient phenomena. 

Inhibition of Methylation Inhibits Neurite Outgrowth 
The marked effects of NGF on the methylation of specific 
proteins calls attention to the possibility that protein 
methylation is required for NGF signal transduction and 
neurite outgrowth. This possibility was examined by ob- 
servmg NGF-mediated neurite outgrowth after inhibiting 
methylation using two approaches based on the biochemi- 
cal pathway outlined in Fig. 6. One approach was inhibi- 
tion of SAHH by a mechanism-based, substrate analogue 
inhibitor, DHCA (Liu et al., 1992). SAHH catalyzes the 
reversible hydrolysis of SAHcy to homocysteine and aden- 
osine by an NAD-dependent mechanism (De la Haba and 
Cantoni, 1959; Liu et al., 1992). The reaction is driven in 
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Figure 6. The methylation 
pathway. SAHH catalyzes 
the reversible hydrolysis of 
SAHcy. Inhibition of SAHH 
by DHCA or homocysteine 
leads to an accumulation of 
SAHcy and inhibition of 
methyltransferases. 



the hydrolysis direction in vivo by the adenosine deami- 
nase-catalyzed conversion of adenosine to inosine. DHCA 
blocks the methylation pathway by inhibiting SAHH 
Like the normal substrate, SAHcy, DHCA is oxidized 
by SAHH coupled to NAD+ reduction, but the 3-keto 
DHCA formed cannot be hydrolyzed, thereby trapping 
SAHH in its inactive, NADH form (Liu et al., 1992). 
SAHH inhibition results in an accumulation of its sub- 
strate, SAHcy, a potent competitive inhibitor of methyl- 
transferases (Hildeshein et al., 1972). 

The concentration dependence for inhibition of neurite 
outgrowth by DHCA (Fig. 7 A) was determined by incu- 
bating PC12 cells with NGF for 7 d with or without 10 nM 
to 3 fxM DHCA. Neurite outgrowth was scored after 7 d of 
NGF treatment as described under Materials and Meth- 
ods. The concentration of DHCA required to inhibit neu- 
rite outgrowth by 50% was ~100 nM. A second approach 
that was used to test for a requirement of protein meth- 
ylation in neurite outgrowth was inhibition of SAHH by 
addition of extracellular homocysteine, alone or in combi- 
nation with DHCA. When intracellular levels of homocys- 
teine rise, the SAHH-catalyzed reaction is driven in the 
synthesis direction (De la Haba and Cantoni, 1959- Ha- 
sobe et al., 1989), i.e., towards the formation of SAHcy 
(Fig. 6). Excess homocysteine acts cooperatively with in- 
hibitors of SAHH in many other cell types (Chiang et al 
1977; Kredich and Martin, 1977; Backlund et al., 1986; Ha- 
sobe et al., 1989), and the combination of the two is ex- 
pected to elicit more inhibition of methylation than either 
alone. As shown in Fig, 7 B, 100 nM DHCA reduced 
NGF-induced neurite outgrowth to 59% of control (NGF 
alone), and 100 \xM homocysteine reduced NGF-induced 
neurite outgrowth to 42%. The combination of 100 nM 
DHCA plus 100 \xM homocysteine inhibited the neurite 
outgrowth by more than 80%. The findings that homocys- 
teme alone inhibits neurite outgrowth, and that homocys- 
teine enhances the inhibitory effect of a low dose of 
DHCA on neurite outgrowth indicate that the effect of 
DHCA on neurite outgrowth is most likely due to inhibi- 
tion of methylation. 

Fig. 8 illustrates the effects of DHCA or DHCA plus 
homocysteine on cell morphology observed by phase con- 
trast microscopy of NGF-treated PC12 cells. After 7 d of 
NGF-treatment, PC12 cells flatten, hypertrophy, and elab- 
orate long neurites (Fig. 8 B). In the presence of 1 ^.M 
DHCA and NGF for 7 d, the cells continue to respond to 
NGF by flattening, but neurite outgrowth is greatly atten- 
uated (Fig. 8 C). Short, spike-like processes are observed. 
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Figure 7. Concentration-dependent inhibition of neurite out- 
growth by inhibitors of methylation. (A) Effect of DHCA. PC12 
cells on 35-mm collagen-coated tissue culture dishes (300 000 
cells per dish) were treated with NGF (50 ng/ml), or NGF and 
DHCA (10 nM to 3 m-M) for 7 d. Neurite- bearing cells were 
scored as described under Materials and Methods. Cultures 
treated with NGF and no DHCA received the DMSO vehicle 
only, which had no effect on neurite outgrowth. The line was fit 
by least squares regression analysis. The data shown are the 
means ± SD for three independent experiments. (B) Effect of 
homocysteine alone and in combination with DHCA All cul- 
tures were treated with NGF (50 ng/ml) and scored for the pres- 
ence of neurites after 7 d. The data shown are the means ± SD 
for three mdependent experiments. The white bar represents the 
NGF control condition without DHCA or homocysteine. Results 
from cultures treated with 100 nM DHCA are represented by the 
adjacent striped bar (DHCA), and those from cultures treated 
with 100 jiM homocysteine in the next striped bar (Hey) The 
cross-hatched bar indicates results from cultures treated with 100 
nM DHCA plus 100 fiM homocysteine (DHCA -^ Hey) 



In the presence of 100 nM DHCA plus 100 homocys- 
teine, the cells appear more flattened than control cells, 
but little neurite outgrowth is observed (Fig. 8 D) Thus' 
inhibition of methylation by DHCA, homocysteine, or the 
cooperative action of the two together suggest that meth- 
ylation IS an important requirement for NGF-induced neu- 



rite outgrowth. In addition, the morphological evidence 
(Fig. 8) indicates that not all NGF actions (e.g., the cell- 
flattening response) are inhibited by factors that inhibit 
methylation (see below). 

DHCA-induced inhibition of SAHH was verified by di- 
rect measurement of SAHH activity. DHCA at 1 nM pro- 
duced 70% inhibition and >90% inhibition was achieved 
at 10 nM. The consequence of this inhibition (elevated intra- 
cellular level of SAHcy) is expected to inhibit protein meth- 
ylation (Fig. 6). Thus, the effect of DHCA on protein meth- 
ylation was directly determined by assessing protein 
methylation in PC12 cells using metabolic radiolabeling 
with L-[methyl-3H]methionine in the presence of anisomy- 
cin. The dose-dependent inhibition of protein methylation 
observed is similar to that of the DHCA effect on neurite 
outgrowth (Fig. 9). DHCA (1 ^M) inhibits total methyl 
group incorporation measured by TCA-precipitation of 
equal aliquots of whole cell lysates as described under Ma- 
terials and Methods. The total reduction in radiolabeled 
methylated protein was 40% in control, non-NGF-treated 
cells, and 52% in cells stimulated with NGF for 6 h. Simi- 
lar differences in the inhibitory effects (±NGF) were ob- 
served at all doses of DHCA above 30 nM (data not 
shown). The inhibitory action of DHCA is thus greater in 
NGF-treated cells. In conclusion, these experiments indi- 
cate that protein methylation is markedly inhibited by 
DHCA at concentrations that inhibit neurite outgrowth 

In contrast, neither NGF alone nor DHCA with or with- 
out NGF have a significant effect on total methylation of 
cytosmes in PC12 cell DNA. The percentage of total cy- 
tosines methylated in NGF-treated cells (53.3 ± 1 5) ver- 
sus that in cells treated with NGF plus DHCA (48 1 ± 0 2) 
was not statistically significant (n = 3, Student's paired / 
test; P > 0.05, two tails). All measurements were per- 
formed after 24-h treatments. 

DHCA Treatment Does Not Interfere with Cell 
Proliferation or Early NGF-mediated Signaling Events 
Previous efforts to examine the possible role of methyla- 
tion in NGF signal transduction (Seely et al., 1984' Haklai 
et al., 1993; Kujubu et al., 1993) used methylation inhibi- 
tors at relatively high concentrations that can produce cy- 
totoxicity (Seeley et al., 1984) and inhibition of receptor 
tyrosine kinases (Meakin and Shooter, 1991; Maher 1993) 
The methylation inhibitor used in this study, DHCA is 
much less toxic than the related adenosine analogues used 
previously and is effective at concentrations four orders of 
magnitude lower than those used previously (Seeley et al 
1984; Meakin and Shooter, 1991; Haklai et al 1993- Ku- 
jubu et al., 1993; Maher, 1993). DHCA does not affect the 
growth rate of PC12 cells in serum-containing medium 
(Fig. 10 A). In the absence of serum, PC12 cells degener- 
ate via programmed cell death. DHCA has no effect on 
the rate that cell death is induced (Fig. 10 A) In the ab- 
sence of serum, NGF rescues PC12 cells from programmed 
cell death (Rukenstein et al., 1991). Also shown in Fig. 10 

vT^^?^^"^ "° ^^^^^^ protection afforded by 

NGF, In addition to confirming the absence of cytotoxic- 
ity, these findings suggest that DHCA does not interfere 
with the ability of NGF to activate its tyrosine kinase 
receptor (Trk), which prevents programmed cell death 
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Figure 8. The effects of DHCA and homocysteine on neurite outgrowth and cell morphology. PC12 cells were olated on 3*5 mm ml 



(Kaplan and Stephens, 1994). Thus DHCA has no detect- 
able effects on the growth or survival of PC12 cells under 
any of the conditions examined. 

Since other adenosine-analogues, which are putative 
methylation inhibitors, are known to inhibit phosphoryla- 
tion and activation of Trk (Meakin and Shooter, 1991; Ma- 
her 1993), the effect of DHCA on the tyrosine phosphory- 
lation of Trk was also determined. PC12 cells were 
incubated with or without 1 jjlM DHCA for 1 h, and NGF 
was then added for 5 min. Immunoprecipitation with a 
phosphotyrosine antibody was followed by SDS-PAGE 
and immunoblotting with anti-Trk antibody. DHCA had 
no detectable effect on the Trk tyrosine phosphorylation 
mediated by NGF (Fig. 10 5), the primary event in NGF 
signaling. 

To determine if DHCA affects other phosphorylation 
events downstream of Trk activation, total phosphopro- 
teins were analyzed by SDS-PAGE. Several NGF-induced 
increases in phosphorylation of proteins are resolved by 
this method (Fig. 10 C). The rapid increase in the phos- 
phorylation of tyrosine hydroxylase (Halegoua and Pat- 
rick, 1980) that occurs after NGF treatment is not affected 



by DHCA. The overall phosphorylation pattern was simi- 
larly unaffected by DHCA. Two other NGF-induced in- 
creases in phosphorylation, which require long-term (7 d) 
NGF treatment, are temporally associated with neurite 
outgrowth. Unlike phosphorylation of tyrosine hydroxy- 
lase, the increased ^^p incorporation into the 64-kD char- 
tin (Aletta and Greene, 1987) and p-tubulin (Aletta, 1996; 
Black et al., 1986) is diminished by exposure of the cells to 
DHCA. The inhibitory effect of DHCA on the phosphory- 
lations of these two proteins is the predicted consequence 
of DHCA inhibition of neurite outgrowth. Thus, DHCA 
affects delayed phosphorylation events in NGF signaling 
but does not inhibit phosphorylations associated with the 
early signaling events studied here. 

DHCA Effects on Neurite Outgrowth Are Diminished 
by prior NGF Treatment and Are Rapidly Reversible 
To explore further the functional significance of the time 
dependence of protein methylation during NGF-mediated 
neurite outgrowth, DHCA (1 ^jlM) was added to PC12 
cells at the same time that NGF was added or 1-6 d after 
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Figure 9. Dose-dependent effect of DHCA on total protein 
methylation in PC12 cells correlates with the DHCA effect on in- 
hibition of neurite outgrowth. Inhibition of NGF-stimulated neu- 
rite outgrowth (■) by DHCA was compared to the inhibition of 
total protein methylation (O) at the same doses of DHCA PC12 
cells were treated with DHCA (30 nM to 3 ^M) and NGF (50 ng/ 
ml) for 6 h in the presence of anisomycin to determine total pro- 
tein methylation as described under Materials and Methods. Inhi- 
bition is plotted relative to control cultures treated with NGF 
only. The neurite outgrowth data are replotted from Fig. 7 A, 



NGF addition. Under these conditions, the later DHCA 
was added after NGF, the less its inhibitory effect (Fig. 11 
A). The decrease in inhibition showed a Hnear dependence 
on the time at which DHCA was added to the cells. Once 
the NGF-signaling cascade begins, neurite outgrowth is 
progressively less sensitive to inhibition of methylation 
This would occur if the rates of demethylation were low 
for those methylated proteins that are required for neurite 
outgrowth. Alternatively, neurite outgrowth may be insen- 
sitive to inhibiting methylation once neurite outgrowth 
proceeds past a critical, committed step. While initiation 
of neurite outgrowth from PC12 cells after exposure to 
NGF IS slow (requiring 7 d to achieve ~100% neurite 
bearing cells), PC12 cells can rapidly regenerate after pre- 
formed neurites are disrupted as described under Materi- 
als and Methods. Fig. 11 B shows the concentration depen- 
dence of inhibition of neurite regeneration by DHCA. 
Neurite regeneration was inhibited 40% by 3 jjlM DHCA 
In contrast, if 3 ixM DHCA is used when NGF is first 
added to the cells, it inhibits neurite outgrowth by 84% 
(Fig. 7 A). As described above, the difference in the sensi- 
tivities of regeneration and de novo neurite outgrowth to 
inhibition by DHCA is consistent with slow rates of de- 
methylation or decreased sensitivity to DHCA once dif- 
ferentiation has reached a critical stage. 

The onset of neurite outgrowth after DHCA is removed 
was examined to compare the rate of reversing the effect 
of DHCA on neurite outgrowth to the rate of NGF- 
induced neurite outgrowth when DHCA had never been 
present. In addition, removal of DHCA was used to detect 
protein methylations that are associated with neurite out- 
growth during the reversal period. Reversibility of the ef- 
fects of DHCA on neurite outgrowth and protein methyla- 
tion was assessed by incubating PC12 cells with NGF and 
DHCA for 10 d, followed by removal of the DHCA Inter- 
estingly, the inhibitory effect of DHCA on neurite out- 
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figure JO. DHCA treatment 
does not interfere with cell 
growth, NGF-mediated sur- 
vival, or rapid onset NGF- 
stimulated phosphorylations. 
(A) PC12 cells were plated 
on 24-well collagen-coated 
dishes at a density of 100,000 
cells per well. Cells were al- 
lowed to attach to the dishes 
overnight, and then were 
washed with RPMI-1640 me- 
dium to remove serum. Cells 
were then cultured in com- 
plete (serum-containing) me- 
dium (CM, O and •), plus or 
minus 1 DHCA; RPMI- 
1640 without serum plus 50 
ng/ml NGF, plus or minus 1 
M-M DHCA (NGF, □ and ■); 
RPMI-1640 medium without 
serum or NGF, plus or minus 
1 M-M DHCA (No serum, A 
and A). Similar results were 
observed in two other inde- 
pendent experiments. (B) Ef- 
fect on Trk phosphorylation. 
PC12 cells were treated with 
or without NGF (50 ng/ml) 
for 5 min. Where indicated, 1 
|xM DHCA was added to 
cultures for 1 h before treat- 
ment with NGF. Cell lysates were equalized for total protein 
immunoprecipitated with anti-phosphotyrosine antibody (as de- 
scnbed under Materials and Methods), resolved by 7.5% SDS- 
PAGE and transferred to Immobilon P membranes The blot 
'^kk''T25^^ "^'^^ anti-Trk antibody, followed by donkey anti- 
rabbit I-IgG. The image shown is an autoradiogram of the 
Western blot after a 5-d exposure at -80°C This result was veri- 
fied in a second independent experiment. (C) PCI 2 cells were 
treated with or without NGF (50 ng/ml) for 2 h. DHCA (1 n M) 
was added 1 h before the addition of NGF where indicated Cells 
were radiolabeled with [^^pjorthophosphate for 2 h as described 
under Materials and Methods. For long term NGF-induced 
phosphorylations, PC12 cells were grown in the presence of NGF 
(50 ng/ml) plus or minus 1 ^M DHCA for 10 d. Cells were then 
radiolabeled with P^pjorthophosphate for 2 h. For both short 
and long term NGF treatment experiments, cell lysates were 
equalized for total TCA-precipitable ^^p.iabeled protein and 
separated on 6-12% gradient SDS-PAGE. The image shown is 
an autoradiogram of the dried gel. Exposure time was 16 h The 
arrows at the right indicate, in decreasing A/, the positions of the 
previously identified phosphoproteins, 64-kD chartin, 60-kD ty- 
rosine hydroxylase, and 55-kD P-tubulin. 
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Figure 11. The effect of prior 
NGF treatment on the neurite 
inhibiting action of DHCA. 
{A) The effect of varying the 
time interval between NGF 
treatment and addition of 
DHCA. PC12 cells were 
treated with NGF (50 ng/ml) 
for 7 d. DHCA (1 ^M) was 
added to the cells either at 
the time NGF was initially 
added to the cell cultures 
(time zero), or after a time 

Cells were scored for the presence of neurites after 7 d nf mhp Th^ ^ ♦ u . interval of 1, 2, 3, 4, 5, or 6 d. 

ments. (B) The effect of inhibition of meTh^a ioJ on neur te SLe^^^^ " for three independent experi- 

was removed from the cultures by an extensive washinrnrS^rH K ^ . 1 "^"^ "S^""') '-'^ d. NGF 

the culture dishes by trituration aL replated without of w th DHCA'rTll "^^ M^tfr-t'^ ^^"^ '^-^'>^-<' ^om 

Neurite regeneration was scored 1^24 hE The liL wr J.ri^^rln?. '"'^''^ ^" ^ ^ ^f^^ P'^ting. 

for three independent experiments ^ '''8'"'"°" '"^'y^'^' ^he data shown are the means ± SD 
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growth was rapidly reversed (Fig. 12 A). No lag was ob- 
served before an increase in neurite outgrowth was 
detected. The t^^i for reversal was 12 h (Fig. 12 A) The 
rate of neurite outgrowth when DHCA is removed is 
therefore, about eightfold faster than the normal Un for 
neurite outgrowth when NGF is added to naive cells 
Thus, the NGF-dependent signaling events that prime the 
cells (Burstein and Greene, 1978; Greene et al., 1982) for 
neurite outgrowth were not inhibited greatly by the meth- 
ylation inhibitor, DHCA. 

Upon removal of DHCA, the rapid onset of neurite out- 
growth was paralleled by rapid reversal of its inhibitory ef- 



fects on the methylation of several proteins. Proteins that 
increase by 25% or more in methylation upon removal of 
DHCA are indicated by the arrows, brackets, and aster- 
isks shown in Fig. 12 B. The molecular weights of proteins 
that showed increased methylation migrate at 150 120 94 
70/68, 50, 48/47, 40, and 35 kD. The time dependencies for 
methylation were different for different proteins. These 
results, together with the time course (Figs. 4 and 5) inhib- 
itor studies (Figs. 7-10), and specificity of the^NGF- 
induced effects on protein methylation (Fig. 1) indicate 
that regulation of protein methylation is required for neu- 
rite outgrowth. 



B Figure 12. Reversal of 

OHCA lOd ^ -9h -tan .24h -«h DHCA-induced inhibition of 

methyltransferases results in 
rapid neurite outgrowth and 
a time-dependent increase in 
protein methylation. (.4) PC12 
cells were plated on 35-mm 
collagen-coated tissue culture 
dishes at a density of 300,000 
cells per dish in NGF (50 ng/ 
ml) plus or minus 1 p,M 
DHCA for 7 d. The medium 
was then changed repeatedly 
to remove DHCA, and neu- 
rite outgrowth was scored 
and plotted over a 3-d pe- 
riod. Controls were main- 
tained in DHCA/NGF (A) 
and NGF alone (•). The ex- 
perimental condition was 

NGF for 7 d, followed bv NGF alone fnr ^ H {m\ tu^ a^* , treatment with DHCA and 

present in cui'tures .reated^Th NGF a,ot.VhiL?h^^^^^^^ -urite bearing cells 

the reversal data is the best fit to a single exponential function w th a ttfof 72 h if e^n^ expenmen.s. The line shown for 

was also assessed by SDS-PAGE. (S) Cells were harvested afteT 0 d of DHrA/^r f , , '""T'' P™'"'" "'^'hylation pattern 
lowed by progressively longer times Ih) in the absrce of DHC A Pen l . T^"' °' ^ DHCA/NGF treatment fol- 

minute and separated on 71,5% grad en, SDS PAGE T^^^^^^ 1°' "^CA-precipitable counts per 

arrows and brackets indicate the pLitions of orotel, rhJ, in.l!/ h '^^^"""^"f am of the dried gel. Exposure time was 5 d. The 

Asterisks mark the position of proTeTn Zds nTd eVtSi^^^^^^^^^^^^^ SI"" "Th'h'^'^^ X '"^ ' '^"^^ 

replicated in four independent experiments. ^^"^es noted dunng the reversal period were 
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Discussion 

The Subset of Methylated Proteins that 
Are NGF-regulated 

This work extends previous studies (Seeley et al 1984* 
Kujubu et al., 1993) of the role of methylation in growth 
factor signaling mechanisms and further demonstrates the 
feasibihty of exploring this role by metabolic radiolabeling 
of specific proteins in intact cells and by in vitro labeling of 
proteins in cell fractions. The results indicate that protein 
methylation is involved in the NGF-induced signal trans- 
duction that leads to neurite outgrowth. Methylation of 
numerous proteins was detected by metabolic radiolabel- 
ing indicating that protein methylation is an ongoing, ac- 
tive process. NGF specifically affects the methylation of a 
subset of these proteins. Further evidence for a significant 
role for protein methylation in neurite outgrowth is the 
marked concurrent inhibitory effects of DHCA on protein 
methylation and neurite outgrowth. Moreover, removal of 
DHCA rapidly leads to concomitant increases in protein 
methylation and neurite outgrowth. The identities and 
functions of the NGF-regulated proteins remain to be de- 
termined. It is likely that some of the proteins identified by 
this approach represent previously unidentified players in 
NGF signal transduction. This is significant because spe- 
cific protein modifications that are temporally coincident 
with neuronal differentiation may be useful in the elucida- 
tion of the mechanism of action of NGF (Chao, 1992; 
Kaplan and Stephens, 1994). Thus, signal transduction 
pathways may be regulated in part by regulating the meth- 
ylation of specific proteins in much the same manner as 
protein phosphorylation regulates signaling pathways. 

Potential Actions of Protein Methylation During NGF 
Signal Transduction 

The data presented here focus on NGF actions on protein 
methylations rather than methylation of phospholipids, 
RNA or DNA. Methylation of genes serves as a general 
signal for repression of transcription (Razin and Cedar, 
1991). Hypomethylation of the genome is associated with 
cellular differentiation in vitro (Bestor et al., 1984). There 
were no significant effects of NGF and/or DHCA on total 
DNA cytosine methylation in PC12 cells. Thus, the potent 
mhibitory action of DHCA on neurite outgrowth is un- 
likely to be due to an effect on DNA methylation. Previ- 
ous work also indicates that NGF does not detectably af- 
fect phospholipid methylation (Ferrari and Greene, 1985). 
These authors further demonstrated that complete'inhibi- 
tion of phospholipid methylation is achieved at concentra- 
tions of deaza-adenosine and homocysteine thiolactone 
that do not block neurite outgrowth. 

There are several potential mechanisms through which 
NGF-induced changes in protein methylation may affect 
biological responses. Gene regulation may be affected, 
particularly if the modified proteins are involved in gene 
expression. For example, many heterogeneous nuclear 
RNPs that associate with pre-mRNAs are methylated on 
arginine residues (Liu and Dreyfuss, 1995). Modulation of 
RNA splicing has been proposed as one potential target 
for protein methylation in signal transduction (Lin et al., 
1996). Consistent with this view, a protein arginine meth- 



yltransferase activity appears to be constitutively associ- 
ated with the type I interferon receptor (Abramovich et 
al., 1997). A role for protein methylation in cell signaling is 
further substantiated by the finding that methylation of 
the y subunit of a heterotrimeric G-protein enhances acti- 
vation of Pl-specific phospholipase C and PI3-kinase in 
vitro (Parish et al., 1995). If this biochemical effect occurs 
in intact cells, protein methylation could play an important 
general role in the amplification of intracellular signals. 
Protein methylation may also be involved in altering the 
subcellular localization of specific proteins, as recently 
suggested for the cell cycle-dependent methylation of pro- 
tein phosphatase 2A (Turowski et al., 1995). 

Neurite outgrowth requires a fundamental reorganiza- 
tion of the cytoskeleton including formation of intermedi- 
ate filaments and parallel bundles of microtubules ori- 
ented longitudinally in neurites, as well as specialized actin 
microfilament arrays within motile growth cones. Inhibi- 
tion of neurite outgrowth by DHCA may be related to in- 
hibition of protein methylation, which affects the cyto- 
skeleton. For example, Ras and Rho proteins, modified by 
isoprenylation and methylation, are involved in redistribu- 
tion of actin microfilaments during changes in cell shape 
(Vojtek and Cooper, 1995). Recently, Luo et al. (1996) 
presented evidence that constitutively active Racl in trans- 
genic mice interferes with the normal formation of Pur- 
kinje cell axons and dendrites. Since only the methylation 
step of small G-protein processing is reversible (Rando, 
1996), methylation is more likely than other processing 
steps to be subject to physiological regulation. NGF- 
induced increased methyl labeling of G-proteins in PC12 
cells (Haklai et al., 1993; Kujubu et al., 1993) and the spe- 
cific proteins observed in this study indicate that NGF may 
serve as one mediator of regulatory control in responsive 
neurons. Furthermore, Klein and colleagues have raised 
the possibility that protein methylation plays an important 
role in neurulation (Coelho and Klein, 1990; Moephuli et 
al., 1997) and neurite outgrowth in rat embryo cultures 
(Coelho and Klein, 1990). The present work lends addi- 
tional support to this possibility, especially with regard to 
neurite outgrowth. 



DHCA Is a Potent and Nontoxic Inhibitor of 
Protein Methylation 

DHCA is a promising tool for investigating the possible 
roles of methylation in signal transduction mechanisms. 
This inhibitor was developed by Liu and colleagues to ob- 
tain a more specific, less toxic inhibitor of SAHH than pre- 
viously used inhibitors (Liu et al., 1992). The 50% effec- 
tive dosage (ED50) for the effect of DHCA on neurite 
outgrowth for PC12 cells (-100 nM) is four orders of mag- 
nitude lower than previously used inhibitors (Seeley et al., 
1984). Inhibition of neurite outgrowth by >~85% is diffi- 
cult to achieve, even at the highest concentration of the in- 
hibitor. Pretreatment of cells with DHCA for several 
weeks before NGF stimulation, however, does produce 
nearly complete inhibition even at 7 d of NGF (Cimato, 
TR., M.J. Ettinger, and J.M. Aletta, unpublished obser- 
vations). These observations provisionally indicate that 
basal, ongoing protein methylation may serve an auxiliary 
role in neurite outgrowth in addition to the NGF effects 
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on protein methylation. As expected, DHCA potently in- 
hibits SAHH activity (Hasobe et al., 1989) and conse- 
quently leads to decreased total protein methylation The 
dose-dependent inhibition of protein methylation ob- 
served IS quite similar to that for inhibition of neurite out- 
growth (Fig. 9). The cooperative effects of homocysteine 
and DHCA on neurite outgrowth further substantiate a 
specific effect on methylation. The finding that total pro- 
tein methylation was inhibited more when NGF was 
present than when absent, suggests that under the influ- 
ence of NGF, many of the potential methyltransferases in- 
volved in the NGF response may be more sensitive to inhi- 
bition due to increased levels of SAHcy. This may occur 
because, as a result of increased total protein methylation 
(Fig. 3), mtracellular levels of SAHcy are expected to be 
elevated even before DHCA treatment. NGF has no mea- 
sureable effect on SAHH activity (Cimato, T.R., M.J Et- 
tinger, and J.M. Aletta, unpublished results). 

DHCA is unique among methylation inhibitors because 
of Its lack of effect on cell flattening, phosphorylation of 
^rosine hydroxylase, and tyrosine phosphorylation of 
Trk. Less specific methylation inhibitors, used to study the 
role of methylation in PC12 cells previously (Seeley et al 
1986; Kujubu et al., 1993; Maher, 1993; Meakin and 
Shooter, 1995), blocked each of these actions. The most 
remarkable property of DHCA in comparison to previ- 
ously used inhibitors is that DHCA inhibits neurite out- 
growth without inhibiting the NGF priming that prepares 
PC12 cells for neurite outgrowth (Greene et al., 1982) 
This is best illustrated by the rapid reversal of DHCA ef- 
fects on neurite outgrowth, which is in marked contrast to 
that observed with other methylation inhibitors. Reversal 
of inhibition by 5'-deoxy-S-methyl adenosine requires the 
same 7 d as required by control PC12 cells when first ex- 
posed to NGF (Seeley et al., 1986). Thus, DHCA is clearly 
more specific than previously used methylation inhibitors 
DHCA specifically affects neurite outgrowth without af- 
fecting many other signaling effects of NGF, including 
those associated with priming. 

Protein Methylation in Early and Delayed 
NGF Signaling 

Growth factors in general, promote biological responses 
that are either extremely rapid (within minutes) or require 
many hours or days to develop fully. The diversity of sig- 
naling networks (Pawson, 1985; Hunter, 1995) and the 
propensity of specific pathways to transmit and/or sustain 
a given signal (Marshall, 1995) may help to explain the dif- 
ferent temporal characteristics of responses. The results of 
this study and other recent work (Metz et al., 1993; Philips 
et al., 1993, 1995) indicate that posttranslational modifica- 
tion of proteins by methylation offers another potential 
mechanism for regulating both early and delayed growth 
factor signaling. 

The importance of early and delayed responses to NGF 
in PC12 cells, dorsal root ganglion, and sympathetic neu- 
rons IS well documented (Greene, 1984). One of the most 
obvious examples of a delayed NGF response is neurite 
outgrowth. Little neurite outgrowth is evident in PC12 
cells treated with NGF for 12-24 h. 50% of the cells extend 
neurites within 4 d, and nearly all have neurites by 7 d 



NGF-mediated neurite outgrowth requires transcription 
(Burstein and Greene, 1978; Greene et al., 1982) The long 
latency is apparently due to the progressive accumulation 
of specific proteins required for neurite assembly during 
the priming period. This accumulation may depend on 
protein synthesis (Burstein and Greene, 1978) and/or pro- 
gressive posttranslational modifications of specific pro- 
teins (e.g., as shown here for the methylation of specific 
proteins). Since regeneration of neurites from PC12 cells 
previously treated with NGF occurs much more rapidly 
(withm 24 h) than initiation (7 d); the longer time required 
for neurite outgrowth in the latter is thought to be due to 
the "priming" events. 

When interpreted in light of the priming model of NGF- 
induced neurite outgrowth, the experiments that used 
DHCA provide important insights regarding the temporal 
nature of protein methylation involved in this NGF re- 
sponse. Inhibition of protein methylation by DHCA is 
most effective in reducing neurite formation when the 
DHCA IS present concomitantly with NGF treatment 
There is decreased inhibition of neurite outgrowth as the 
time interval between NGF treatment and the addition of 
DHCA increases (Fig. 11 A). DHCA is also less effective 
at inhibiting neurite regeneration (Fig. 11 B). These re- 
sults imply progressive accumulation of the methylated 
protein product(s) necessary for neurite outgrowth as was 
detected at 4 d of NGF treatment (Fig. 5). Slow turnover 
ot methylated proteins is consistent with the reduced ef- 
fect of DHCA when added after NGF signaling has been 
initiated. 

The time dependencies of specific protein methylations 
the effects of DHCA on protein methylation, and the lack 
of effects of DHCA on early protein phosphorylations and 
priming are consistent with little or no interdependence 
between protein methylation signaling events and early 
priming events. Moreover, the data presented on the phos- 
phorylation of Trk and other proteins that lie downstream 
of Trk activation (Fig. 10, B and C) further substantiate 
the view that some cellular signaling events are more sus- 
ceptible to interference from inhibition of protein methy- 
lation than others. Autophosphorylation of Trk and the 
rapid increase in the phosphorylation of tyrosine hydroxy- 
lase (Halegoua and Patrick, 1980) are not affected by 
DHCA. Two other NGF-induced increases in phosphory- 
lation, which require long term (^7 d) NGF treatment for 
maximal phosphorylation, are temporally associated with 
neurite outgrowth. In accord with DHCA-mediated inhi- 
bition of neurite outgrowth, but unlike the rapid early 
phosphorylation events, the increased ^2? incorpoVation 
into the 64-kD charfin (Aletta and Greene, 1987) and 
P-tubulin (Black et al., 1986; Aletta, 1996) is diminished by 
exposure of the cells to DHCA. 

It can thus be concluded that DHCA does not disrupt all 
early NGF signaling events and priming actions, nor does 
It abrogate many of the biological effects of NGF signaling 
fr^lf}^"^' ^'S- ^' flattening). The effects 

ot DHCA on neurite outgrowth and inhibifion of the 
phosphorylation of neurite-associated proteins (p-tubulin 
and chartin) are consistent with a selective inhibitory ac- 
tion on an NGF pathway responsible for neurite out- 
growth. Based on all of the above results, the protein 
methylation required is likely to occur downstream from 
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or parallel to, rapid NGF actions and priming events. This 
interpretation does not exclude the possibility that protein 
methylation of rapid onset also plays an important role in 
neurite outgrowth. Once priming is achieved, NGF stimu- 
lation of some of the rapid protein methylations observed 
in this study may assume added significance, particularly if 
they occur at the growth cone. 
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